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I Abstract 
Consider a population of fixed size that evolves over time. At each time, the genealogical 
structure of the population can be described by a coalescent tree whose branches are traced 
back to the most recent common ancestor of the population. As time goes forward, the 
genealogy of the population evolves, leading to what is known as an evolving coalescent. We 
will study the evolving coalescent for populations whose genealogy can be described by the 
Bolthausen-Sznitman coalescent. We obtain the limiting behavior of the evolution of the 
time back to the most recent common ancestor and the total length of the branches in the 
tree. By similar methods, we also obtain a new result concerning the number of blocks in the 
Bolthausen-Sznitman coalescent. 

1 Introduction 

Consider a haploid population of fixed size n that evolves over time. The genealogy of the 
population at time t can be represented by a coalescent process (n(s), s > 0) taking its values in 
the set of partitions of {1, . . . , n}, which is defined so that integers i and j are in the same block of 
n(s) if and only if the ith and j'th individuals in the population at time t have the same ancestor 
at time t—s. The genealogical structure encoded by the coalescent process can also be represented 
as a tree Tn{t). The shape of this tree changes over time as the population evolves, leading to 
what was called in [23] an evolving coalescent. The associated tree- valued stochastic process, for 
infinite as well as finite populations, was constructed and studied by Greven, Pfaffelhuber, and 
Winter |19j . Depperschmidt, Greven, and Pfaffelhuber [TT] incorporated mutation and selection 
into the model. 

Rather than studying the full tree-valued process, one can follow the evolution of certain 
properties of the tree that are of interest. One such property is the time back to the most recent 
common ancestor (MRCA) of the population. This evolution can be described by a process 
{An{t),t > 0), where 

t — An{t) = sup{s : one individual at time s is the ancestor of all individuals at time t}. 

Note that An{t) is the height of the tree Tn{t)- The process {An{t),t > 0) increases linearly 
at speed one between jumps, and jumps downward when one of the two oldest families in the 
population dies out, causing a new MRCA to be established. One can also consider the process 
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{Ln{t),t > 0), where Ln{t) denotes the sum of the lengths of all branches in the tree Tn{t). This 
process is of interest because, assuming that mutations occur at a constant rate 9 along each 
branch of the coalescent tree, Ln{t) should be roughly proportional to the number of distinct 
mutations observed in the population at time t. 

A natural population model to consider is the Moran model [22j. In this model, the population 
size stays fixed at n. Changes in the population occur at times of a homogeneous Poisson process, 
and we will scale time so that these changes occur at rate n(n — l)/2. At the time of each such 
change, one of the n individuals is chosen at random to give birth to a new offspring, and 
independently one of the other n — 1 individuals is chosen at random to be killed. For any fixed 
t G M, the genealogy of the population follows Kingman's coalescent |20) . meaning that each pair 
of lineages merges at rate one and no other transitions are possible. An analogous construction 
for infinite populations can be carried out using the lookdown construction of Donnelly and Kurtz 
|14j . The associated evolving coalescent was studied by Pfaffelhuber and Wakolbinger [23j. They 
showed that the jumps of the process {A{t),t > 0) that follows the time back to the MRCA occur 
at times of a homogeneous Poisson process, but that the process {A{t),t > 0) is not Markov. 
They also calculated the distributions of some other quantities, such as the number of individuals 
in the population at time t that will have descendants in the population when the next MRCA 
is established and the number of individuals in the population that will become the MRCA of 
the population in the future. Delmas, Dhersin, and Siri-Jegousse [12] extended these results by 
considering also the distribution of the sizes of the two oldest families at time t. Simon and 
Derrida [32j did some further work related to the evolution of the MRCA for populations with 
genealogies governed by Kingman's coalescent, and considered correlations between the time back 
to the MRCA and a measure of genetic diversity. Pfaffelhuber, Wakolbinger, and Weisshaupt 
[21] studied the evolution of the total branch length. They showed that the sequence of processes 
(L„(t) — 21ogn,i G M) converges as n — ?■ oo in the Skorohod topology to a limit process which is 
a stationary process with infinite infinitesimal variance. 

Evans and Ralph |17j studied the dynamics of the time back to the MRCA in a population 
in which a single "immortal particle" produces offspring at times of a Poisson process, and 
descendants of the offspring eventually die out. In this setting, the process {A(t),t > 0) is a 
Markov process whose jump rates and stationary distribution can be calculated explicitly. An 
example of a process that fits into this framework is the a-stable continuous-state branching 
process conditioned on nonextinction with 1 < a < 2. 

The goal of the present paper is to determine the dynamics of the time back to the MRCA 
and the total branch length for populations whose genealogy is given by the Bolthausen-Sznitman 
coalescent. The Bolthausen-Sznitman coalescent, which was introduced in [6], is an example of 
a coalescent with multiple mergers \25\ |28] , in which it is possible for many lineages to merge at 
once. More precisely, the Bolthausen-Sznitman coalecent started with n blocks is a continuous- 
time Markov chain taking its values in the set of partitions of {1, . . . ,n} such that whenever the 
partition has b blocks, each transition that involves the merger of k blocks into one is happening 
at rate 




(1) 



This means that the total rate of all transitions when there are b blocks is 
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It is well-known (see, for example, [25\ ) that this process has the property of sampling consistency, 
meaning that if m < n, then the process restricted to the integers {!,... ,m} is a Bolthausen- 
Sznitman coalescent started with m blocks. 

The reason for focusing on the Bolthausen-Sznitman coalescent is that this coalescent pro- 
cess has arisen in a wide variety of settings. The Bolthausen-Sznitman coalescent has been 
shown recently to describe the genealogy of certain populations undergoing selection O [8]. The 
Bolthausen-Sznitman coalescent also describes the genealogical structure of Neveu's continuous- 
state branching process [5], certain Galton- Watson processes with heavy-tailed offspring distri- 
butions ^ilj, and Derrida's generalized random energy model [6], [7]. 

We describe the population model that we will study in subsection 1.1. In subsection 1.2, we 
state our main result concerning the dynamics of the time back to the MRCA. In subsection 1.3, 
we state our main result concerning the total branch length. By similar methods, we also obtain 
a new result concerning the number of blocks of the Bolthausen-Sznitman coalescent. We state 
this result in subsection 1.4. The rest of the paper is devoted to proofs. 



1.1 A population model 

We now define more precisely the population model that we will study in this paper. We assume 
that for all times t G M, there are exactly n individuals in the population, labeled by the integers 
1, . . . , n. Changes in the population occur at times of a homogeneous Poisson process on M with 
rate n — 1. At the time of such a change, one particle is chosen at random to give birth to a 
random number ^ of new offspring, with 

Then ^ of the n — 1 individuals who did not give birth are chosen at random to be killed, and 
the new individuals take over the labels of the individuals who were killed. 

We now give a representation of the genealogy of this population. For each s G M and t > 0, 
let n„(s, t) be the partition of 1, . . . , n such that i and j are in the same block of n„(s, t) if and 
only if the individuals at time s labeled i and j are descended from the same ancestor immediately 
before time s — t. We consider the population immediately before time s — t, rather than exactly 
at time s — t, to ensure that for each s G M, the process (n„(s,t),t > 0) is right continuous. 
As long as there is no change in the population at time s, the partition n„(s,0) consists of n 
singletons. However, if k new individuals are born at time s, then n„(s,0) will consist of one 
block of size k + 1 and n — k — 1 singleton blocks. 

Proposition 1. Fix s G M. Then (n„(s,t),t > 0) is the Bolthausen-Sznitman coalescent started 
with n blocks. 

Proof. With probability one, there is no change in the population at time s, and n„(s, 0) consists 
of n singletons. When ancestral lines are followed backwards in time, an event in which k — 1 
individuals are born becomes an event in which k randomly chosen lineages merge, because the 
ancestral lines of the k — 1 children merge with that of the parent. The rate of events in which 
k — 1 individuals are born is 
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where An,fc comes from ([T]). Therefore, {Iln{s,t),t > 0) follows the dynamics of the Bolthausen- 
Sznitman coalescent up to the time of the first merger. That the process continues to follow 
the dynamics of the Bolthausen-Sznitman coalescent after this time is a consequence of the ex- 
changeability of the population model and the sampling consistency of the Bolthausen-Sznitman 
coalescent. □ 

For s G M and t >0, let Nn{s, t) be the number of blocks of the partition n„(s, t). Let 

An{s)=mi{t : Nn{s,t) = 1}, 

which is the time back to the MRCA of the population and corresponds to the height of the tree 
Tn{s) that represents the genealogy of the population at time s. Let 

/•oo 

Ln{s) = Nn{s,t)li^N„{s,t)>l}, 

Jo 

which is sum of the lengths of all branches in the tree Tn{s). 
1.2 Time back to the MRCA 

We consider here how the time back to the MRCA of the population evolves over time. Proposi- 
tion 3.4 of |18) states that if Y has the exponential distribution with mean 1, then for each fixed 
t S M, we have 

An{t) - log log n ^ - log Y, (4) 

where =^ denotes convergence in distribution as n — )• oo. We are interested here in finding the 
limit of the stochastic processes {An{t),t > 0) as n — )• oo. 

We now construct the limit process {A{t),t G M) and its time-reversal {R{t),t G M) from a 
Poisson point process. The construction is similar to constructions in [16] and |T7]. Let be a 
Poisson process on M? with intensity A x i/, where A is Lebesgue measure and v{dy) = dy. Let 
M consist of the points {{t,y) : {—t,y) is a point of A^}. Note that M is also a Poisson process 
on with the same intensity as N. For each {t,x) G M^, define the wedges 

W{t, x) = {{s, y) £ : s < t and y > X + 1 - s}, 
W'{t, x) = {{s, y) £ : s > t and y > X + s - t}. 

Then let 

R{t) = sup{x : there is a point of A in W{t, x)}, 
A{t) = sup{x : there is a point of M in W'{t,x)}. 

Note that with this construction, both {R{t),t G M) and {A(t),t G M) are right continuous, and 
because of the relationship between N and M, we have A{t) = R{—t) for all t such that there is 
no point in M whose first coordinate is t. 

The figure below shows how (R{t),t G M) and {A{t),t G M) are constructed from the Poisson 
point process. The process {R{t),t G M) decreases linearly at speed one between jumps but jumps 
up to the level of any point of A^ that appears above it. That is, if (t, y) is a point of N and 
R{t—) < y, then R{t) = y. The process {A{t),t G M) increases linearly at speed one between 
jumps. When the trajectory of the process encounters a point of M at time t, the process jumps 
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Figure 1: The processes {R{t),t G M) and {A{t),t G M) 



downward to the highest level y such that there is a point of M on the diagonal half-line starting 
at {t,y) and extending upward and to the right. 

The following theorem is our main result concerning the time back to the MRCA for the 
evolving Bolthausen-Sznitman coalescent. This result is proved in Section [2] by considering the 
processes in reversed time and establishing convergence to {R{t),t > 0). Note that unlike in the 
case of Kingman's coalescent, the limit process is Markov. 

Theorem 2. As n ^ oo, the sequence of processes {{An{t) — loglogn),t > 0) converges in the 
Skorohod topology to {A{t),t > 0). 

Remark 3. For each fixed t >0, the event that R{t) < y differs from the event that there is no 
point of N in W{t, y) only on a set of probability zero. The probability that there is no point of 
N in W{t,y) is 



If Y has the exponential distribution with mean one, then P(— logl" < y) = ^(1" > e~^) = 
exp(— e~^). Thus, we see from the above construction that the processes {R{t),t G M) and 
{A{t),t G M) are stationary processes whose stationary distribution is the same as the distribution 
of — logy. This result also follows from Theorem [2] and equation 

Remark 4. The processes {R{t),t G M) and {A{t),t G M) are both examples of piecewise de- 
terministic Markov processes, a class of processes whose theory was developed by Davis [9| llOj. 
These processes are characterized by their deterministic behavior between jump times, which is 
linear drift for the processes {R(t),t G M) and {A{t),t G M), and their jump rates. The jump 
rates for {R{t),t G ffi) can easily be read from the Poisson process A^. The process {R{t),t G M) 
jumps away from x at rate e~^, and when it jumps away from x, the distribution of the location 
to which it jumps has density e^~^l|j^>j.}.. This means that the rate of jumps from x to y is given 




by q{x,y) = e yi 



{y>x}- 



To obtain the jump rates for {A{t),t € M), note that if A{t) = x then there is a point of M 
at (t + s,x + s) for some s > but no points above this diagonal line. Because the density of 
the intensity measure of M at (t + s,x + s) is e"^^"'"''-', we see that conditional on A(t) = x, the 
distribution of the time before the next jump is exponential with mean 1. That is, for all x G M, 
the process {A(t),t E M) jumps away from x at rate one, which implies that the jump times of 
{A{t),t E M) form a homogeneous Poisson process of rate one on M. If the process jumps away 
from X at time t, then the probability that it jumps below y is the probability that there is no 
point of M in the trapezoidal region {{s,z) : s >t and y + s — t<z<x + s — t}, which is 

/ e""" dzds) = exp(e~'^ - e"^). 

Jy+s-t J 

Differentiating with respect to y, we see that the rate of jumps from x to y is given by r(x, y) = 
exp(e-^ - - y)'^{y<x}- 

As a check on these formulas, let 7r(y) = e~^e~^ ^ for y E M, which is the density of the 
stationary distribution, obtained by differentiating the right-hand side of Then note that 
Tr{x)q{x,y) = ■K{y)r{y,x) for all x,?/ E ffi, as expected given that {R{t),t E M) and {A{t),t E ffi) 
are related by time reversal. 




1.3 Total branch length 

Theorem 5.2 of p!3] establishes that for each fixed t E 

(logn)^ / n n log log n 



Lnit) X, (6) 



n \ logn (logn)^ 

where, using 7 to denote Euler's constant, 

£'[e*"^] = exp ^ - + in log |n| 

= exp ^zn(l - 7) - J (1 - e*"""" + znxl||j.|<i})x"^ dx^ . (7) 

We consider here the stochastic process L„ = {Ln{t),t > 0). If there are no changes in the 
population between times t and t + s, then the tree Tn{t + s) is obtained by starting with the 
tree Tn{s) and then adding a segment of length s to each of the n branches. Consequently, the 
process Ln increases at speed n between jumps. However, if k individuals die at time t, then the 
tree 7^(t) is obtained from the tree 7^(t— ) by removing k of the branches, causing a downward 
jump in the process L„. Our main result concerning the dynamics of the total branch length for 
the Bolthausen-Sznitman coalescent is that the processes {Ln{t),t > 0), properly centered and 
scaled, converge to a stable process of Ornstein-Uhlenbeck type. 

We now review some facts about processes of Ornstein-Uhlenbeck type, which can be found 
in Chapter 17 of [30] . Suppose {Z{t),t > 0) is a Levy process such that 

£;[e^"^(*)] = exp Uaut - - t J (l " e'""^ + it^xl||^|<i|) i^idx)] . 

Given c > and a random variable ^(0), there is a unique process {X{t),t > 0) having paths 
that are almost surely right continuous with left limits such that 

X{t) = X{0) + Z{t) -c [ X{s) ds (8) 

Jo 
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almost surely. Following [30], we call {X{t),t > 0) the process of Ornstein-Uhlenbeck type 
generated by (a, b, u, c). As long as 

/ log l^l v{dx) < oo, 

J\x\>2 

the process {X{t),t > 0) has a unique stationary distribution fi, with characteristic function 

J e*"''/x((ij;) = exp liau - J (l - e*""^ + i«xl{|j.|<i}) p{dx)\ , (9) 

where /3 = 6/2c, a = {a + z^((l, oo)) — oo, — l)))/c, and for all Borel sets 

/'OO /'OO 

p{B) = - / iB(e-'^y)dsKdy) 

C J-oo JO 

(see Theorem 17.5 of [30]). In this case, if X(0) has distribution p, then the process {X{t),t > 0) 
is stationary. 

Theorem 5. Let v he the measure on M whose density with respect to Lebesgue measure is given 
by x~^l(_oo o)(^)- {L{t),t > 0) be a stationary process of Ornstein-Uhlenbeck type generated 
by {2 — 'J, 0,1^,1). As n ^ oo, the sequence of processes 

{lognf / / t \ _ n _ nlog logn \ ^ ^ ^ 
n \ "ylogny logn (logn)^ y ~ 

converges in the Skorohod topology to {L(t),t > 0). 

Remark 6. The stationary distribution of {L{t),t > 0) has characteristic function given by ([9]) 
with a = 1 — 7, /3 = 0, and p = which matches the right-hand side of d?]). To see this, observe 
that oo)) = and oo, 1)) = 1, so q = (2 — 7) — 1 = 1 — 7. Also, for all z > 0, 

/O /"OO />oo /- — ze" 

I 'i-(~oo -z]{(i~''y)y~'^ ds dy = / y~^ dy ds = - = iy{{-oo,-z]). 
00 Jo Jo J~oo Z 

Thus, the convergence implied by Theorem [5] for each fixed t is consistent with the result 



1.4 Number of blocks 

The techniques used to establish Theorem [5] can also be used to prove a new result about how the 
number of blocks of the Bolthausen-Sznitman coalescent changes over time. Because the number 
of blocks for other coalescents with multiple mergers has been studied in some depth (see, for 
example, [U [2]), we believe that this result may be of independent interest. 

Theorem 7. Let {Iln{t),t > 0) be a Bolthausen-Sznitman coalescent started with n blocks. Let 
Nn{t) be the number of blocks of Unit), and let 

nte~^ log logn\ 



n \ \ log n / 
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Let {S{t),t > 0) be a stable Levy process satisfying 




u\ + itu log \ u 



As n —7- oo, the sequence of processes {Xn{t),t > 0) converges in the Skorohod topology to 



2.1 Construction from random recursive trees 

Our proof of Theorem [2] makes use of a connection between the Bolthausen-Sznitman coales- 
cent and random recursive trees that was discovered by Goldschmidt and Martin [18]. Suppose 
ii, . . . ,in are disjoint subsets of N such that if 1 < i < j < n, then the smahest element of £{ 
is less than the smallest element of £j. A random recursive tree with vertices labeled , . . . , £^1 
can be constructed inductively as follows. The vertex labeled £i is the root. For k >2, once the 
vertices labeled £i, ■ ■ ■ ,£k-i have been placed in the tree, the vertex labeled £k is attached to a 
vertex chosen uniformly at random from those labeled 1, . . . , £k-i- 

Suppose e is an edge in the tree connecting vertices x and y, where x is closer to the root 
than y. We can cut the tree at the edge e by deleting the edge from the tree as well as the entire 
subtree below e. That is, we remove all vertices z such that the shortest path from the root to z 
goes through x. All integers that are in labels of vertices that are removed from the tree are then 
added to the label of x. When a random recursive tree is cut at a randomly chosen edge, the 
remaining tree is a random recursive tree on the new set of labels (see Proposition 2.1 of [18j). 

To establish the connection with the Bolthausen-Sznitman coalescent, start with a random 
recursive tree on n vertices, labeled with the integers l,...,n. Then to each edge, add an 
independent exponential random variable with mean 1, whose value gives the time at which the 
edge is cut. For all t >0, let n„(t) denote the partition of {1, . . . , n} such that i and j are in the 
same block of Unit) if and only if the integers i and j are in the same vertex label at time t. Then 
(n„(t), t > 0) is the Bolthausen-Sznitman coalescent started with n blocks (see Proposition 2.2 of 
|18|). Because the last transition always involves deleting an edge adjacent to the root, the time 
back to the MRCA for this Bolthausen-Sznitman coalescent is the maximum of the exponential 
random variables assigned to the edges adjacent to the root. In [18], Goldschmidt and Martin 
used this fact to prove (jl]). 

We now use recursive trees to construct the evolving Bolthausen-Sznitman coalescent in re- 
versed time. Note that the dynamics of an evolving Bolthausen-Sznitman coalescent in reversed 
time are the same as the dynamics of an ordinary Bolthausen-Sznitman coalescent, except that 
whenever k lineages are lost due to a merger, these lineages are replaced by k new lineages. For 
the purposes of studying the time back to the MRCA, the labeling of the lineages by the integers 
1, . . . ,n is unimportant, so we will use a different vertex labeling scheme in the recursive tree 
construction. 

To carry out this construction, begin with a random recursive tree having n vertices con- 
structed as above, and give every vertex the label zero. Add an independent exponential random 
variable with mean 1 to each edge to obtain the tree at time zero. The process evolves in time 
as follows. The edge labels decrease linearly at speed one. When an edge label hits zero at, say. 




2 Proof of Theorem [2] 
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time t, this edge is cut from the tree and ah vertices below the edge are removed. If this cut 
removes k vertices, then we replace these vertices by adding k new vertices to the tree, one at a 
time, to randomly chosen vertices of the existing tree. The k new edges are assigned independent 
exponential random variables. The k new vertices are given the label t, corresponding to the 
time when they are added to the tree. Then the process continues to evolve according to the 
same rules. 

By the result of Goldschmidt and Martin, we know that this process follows the dynamics of 
the Bolthausen-Sznitman coalescent up to the time of the first merger, in the sense that the time to 
the first transition is exponential with rate n — l = \n and the probability that the first transition 
eliminates k vertices is Xn,k+i/^n- To see that the same dynamics continue after the first merger, 
note that if the first cut causes k vertices to be removed, the remaining tree has the shape of 
a random recursive tree on n — A; vertices, while the edge lengths remain exponential random 
variables with mean 1 by the memoryless property of the exponential distribution. Consequently, 
when k more vertices are added according to the recursive procedure, the resulting tree has the 
shape of a random recursive tree on n vertices, and all of the random variables attached to 
the edges have the exponential distribution with mean 1. Thus, this process follows the same 
dynamics as the population process followed backwards in time, with each set of k lineages 
merging at rate Xn,k- 

Let Mn{t) denote the maximum of the exponential random variables assigned to the edges 
that are adjacent to the root at time t. Then t+M„(t) is the first time at which every vertex other 
than the root has a label greater than i, and Mn{t) corresponds to the time back to the MRCA 
of the population at time —t. Let Rn{t) = Mn{t) — log log n. Then, we see that > 0) 

has the same finite-dimensional distributions as t) — loglogn,t > 0), and the two processes 

would have the same law if the process {An{—t),t > 0) were modified at the jump times to make 
it right-continuous rather than left-continuous. Consequently, in view of the stationarity of the 
population process, to prove Theorem [21 it suffices to show that the processes {Rn{t),t > 0) 
converge in the Skorohod topology to {R{t),t > 0), and it is this result that we will show. 

2.2 A heuristic argument 

To understand heuristically why Theorem [2] is true, note that if Rnit) < z, then the process i?„ 
jumps above z only when a new vertex is attached to the root, and the random variable assigned 
to the new edge is greater than log log n -|- z. Because the number of blocks in the Bolthausen- 
Sznitman coalescent decreases by A; — 1 whenever k blocks merge into one, the rate at which 
blocks are being lost, and thus new vertices are being added to the tree, is 



As long as not too many vertices are cut away from the tree at once, the probability that a new 
vertex attaches to the root is approximately 1/n. The probability that the exponential random 
variable assigned to the new edge is greater than log log n + z is e~^/logn. Hence, the rate at 
which the process Rn jumps above z is approximately 



in agreement with the dynamics of the process {R{t),t > 0). The rest of the proof consists of 
making these ideas rigorous. 
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2.3 Lemmas pertaining to random recursive trees 

We prove here two lemmas related to random recursive trees that will be used later in the proof 
of Theorem [2j 

Lemma 8. For all e > and z € M, the probability that there exists u G [0, e] such that 
Rn{u) 7^ Rn{u—) and Rn{u) > z is at most e(l + 2e~^). 

Proof. The rate at which blocks are being lost in the Bolthausen-Sznitman coalescent is given by 

n 

En 
-<n\ogn. (10) 

k=2 

The rate of transitions in which at least half the blocks are lost is 

SO the probability that such a transition occurs by time e is bounded by e. 

By the construction, for each block that is lost due to a merger in the Bolthausen-Sznitman 
coalescent, a new vertex is added to the tree. As long as the merger causes at most half the 
blocks to disappear, the probability that each new vertex attaches to the root is bounded by 2/n. 
Furthermore, when a new vertex attaches to the root, the probability that it causes the process 
{Rn{t),t > 0) to jump above z is the same as the probability that an exponential random variable 
with mean 1 is greater than log log n + z, which is e~(^°^'°^""'"^-' = e~^/logn. Consequently, in 
view of ([To]) , the probability that the process {Rn{t),t > 0) jumps above z before time e is 
bounded by 

2 

e + ejn----. <e(l + 2e-^), 

n log n 

which implies the result. □ 

Lemma 9. Consider a random recursive tree with vertices labeled 1, . . . ,n. Let d^ be the depth 
of the vertex labeled k, which is the number of edges on the path from the root to k, and let 
Dn = di + ■ ■ ■ + dn- Then 



E\Dn]=n{ 

k=l 



■ h 

Also, there exists a positive constant C such that 

Var(D„) < Cn^. (12) 



Proof. We first prove (jlip by induction. Because the vertex labeled 1 is the root vertex, which 
has depth zero, clearly = 0, verifying (llip . Suppose (lllh holds for n = m — 1, where m > 2. 

Let Fm-i = cr{di, . . . , dm~i)- Recall that the random recursive tree can be constructed so that 
the vertex labeled m is attached to one of the previous m—1 vertices at random. Consequently, 
the level of the vertex labeled m is one greater than the level of a randomly chosen previous 
vertex, so 

E[dm\J^rn-l\ = H --, = H T (13) 

m—1 m — 1 
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and thus 



m — 1 

Therefore, using the induction hypothesis, 

- m—l 



k=l ^ ^k=l ^ 



which imphes that (jlip holds for all n G N. 
We next show by induction that 



Var(Z)„)<n^X;^ 



A;2 
fe=i 



(14) 



The result is clear when n = 1 because Var(Z?i) = 0. Suppose the claim holds for n = m — 1, 
where m>2. By (fT3|) . E[Dm\^m-i\ = 1 + rnDm-i/{m — 1), so using the induction hypothesis, 

Var(Z)„) = E[Var(Z)™|J-„_i)] + Var(E[An|-^m-i]) 

/ TTl 

= E[Var{dm\J^m-i)] + Var -Dm~i 

\m — 1 

E[dl] 



<E[E[dl\Tm-i]]+m^Yl u2 

k=l 



m 



2^m 

k=l 



Now (|14p follows by induction. 

It remains to bound E[d1]. Suppose k > 2. li dk = j > 2, then there is a sequence of numbers 
1 = iQ < ii < ■ ■ ■ < ij = k such that during the construction of the random recursive tree, vertex 
ii attaches to vertex for i = 1, . . . ,j. Because the vertex ii has a choice of — 1 vertices to 
which it can attach, the probability of this event is l/[(ii — 1) . . . (ij-i — l){k — 1)]. Thus, 

Pidk = j) = j:^ E ' 



k-l ^ (ii - 1) . . . - 1) 
(A;-l)(i-l)!' 



^ l)(i-l)!(§i-l) 



^ (l + logfcp-^ 
- [k-\){3-\)\' 
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Combining this bound with the trivial bound that P{dk = j) < 1 for j = 1,2 gives 

(1 + log f (1 + log ky-^ 



5 + 



1 ^ (7 - 11 



.^3(i-l)(j-2) (j-3)! 



9(l + bgfc)^ 
2(fc-l) ' 

<C(logA;)2 (15) 
for some positive constant C. Combining this bound with (jl4p gives (jl2p . □ 



2.4 Generator of the limit process 

From the Poisson process construction described in the introduction, it is clear that (R{t),t > 0) 
is a Markov process. Furthermore, it is easy to describe the transition semigroup of the Markov 
process. Suppose R{0) = x. Then for y > x — t, we have R{t) < y when there are no points in 
the Poisson process N above the line segment from (0, y + t) to (t, y). It follows that 

P{R{t) < y) = exp r - / / e-' dz ds) = exp ( - e-^(l - e"*)) , 

V Jo Jy+t-s ) 

and 

P{R{t) = X - t) = exp ( - e-(^-*)(l - e"*)). 

Let E = [— oo, oo), and for x,y £ E, let d{x, y) = |e^ — e^|. Then {E, d) is a complete separable 
metric space. Let Cq{E) be the set of continuous real-valued functions on [—00,00) that vanish 
at infinity with the norm ||/|| = sup^.^^; |/(x)|. Note that if / G Cq{E), then limj;_s._oo f{x) exists 
and equals /(— 00), and lima,_>oo f{x) = 0. If / G Co{E) and x £ E, define 

/•oo 

Ptfix) = / f{y)e-'-'^'-'-'^e-y{l - e"*) dy + f{x - t)e-'-'^-''^^-'-'\ (16) 

Jx-t 

so that E[f{R{t)y\ = Ptf{x) when i?(0) = x. Note that the definition of Ptf{x) makes sense 
when x = —00, in which case the second term is zero. It is easily checked that Ptf G Cq{E) and 
that Ptf f as t ^ 0. Consequently, {Pt)t>o is a Feller semigroup on Cq{E), and {R{t),t > 0) 
is a Feller process with semigroup {Pt)t>o- 

The following result characterizes the infinitesimal generator of the process {R{t),t > 0) and 
describes a core for the generator. We see from the form of the generator in (IITD that for y > x, 
the process jumps from 2; to y at rate e~^. See also chapter 26 of [10] for a full characterization 
of the domain of the extended generator. 

Lemma 10. Let A be the infinitesimal generator associated with {Pt)t>o- Let C be the collection 
of functions f that are constant on [—oo,z] for some z > —00 and have the property that f, f , 
and f" are in Co{E) when we define f and f" in the usual way on (—00,00) and set /'(— 00) = 
/"(— 00) = 0. Then for all f £ C and all x £ E, 

l-OO 

Af{x) = -fix) + / e-yifiy) - fix)) dy. (17) 

J X 

Furthermore, C is a core for A. 
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Proof. Fix f €z C. Choose a real number z such that / is constant on [—00,2;]. By (|16p . for all 

X £ E, 



Ptfix) - fix) (fix -t)- /(x))e-^"'^"*'(i-e-) 



1 



(/(y) - f{x))e'--'(^---'h-yil - e-*) dy. 



(18) 



x—t 



We need to show that the right-hand side of (jlSp converges to the right-hand side of (|17p uniformly 
in X as t — )• 0. Because /" is bounded, 



f{x-t)-f{x) 



-fix) 



uniformly in x as t — )• 0. Also, 

fix - t) - f{x) _ 



< 



/(x-t)-/(x) 



which tends to zero uniformly in x as t — >■ 0. Because 



--(i-e-')(i_e-t) 



< 1 - e" 



-e-2'(l-e-*) I 



+ e 



-e-!'(l-e-*) 



1 -e" 



we have 



(/(y) - f{x))e-y ( 1 - ^ ^ ) dy 



POO 




< 2||j 




J z 





which tends to zero uniformly in x as t — )■ 0. Likewise, 

7 r (/(y)-/(x))e-^"^(i-^"*)e-^(l-e-*)dy 
tends to zero uniformly in x as t —J- 0. Therefore, 

/•oo 

ifiv) - /(x))e-^-^(i-^"*)e-ni - e~*) ^ / e-^(/(y) - /(x)) 



x—t 



uniformly in x as t — t- 0. Equation (jl7p follows. 

It remains to show that C is a core for A. It is easy to see that C is dense in Cq{E). Suppose 
f & C, and choose z so that / is constant on [—00, z]. For all x < 2; + 



roo 

'tfix) = / /(y)e-^"^(i-'^-*)e-^(l - e"*) dy + f{z)e- 

J z 



■e-^(l-e-') 



Thus, Ptf is constant on [—00,2 + 1\. By differentiating the right-hand side of ()16p . we see that 
the first and second derivatives of Ptf are continuous and vanish at infinity. Thus, Ptf ^ C. It 
follows from Proposition 3.3 in Chapter 1 of [15] that C is a core for A. □ 
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2.5 Convergence of finite-dimensional distributions 

We will show here that the finite-dimensional distributions of the processes > 0) de- 

fined from random recursive trees in section 12.11 converge as n ^ oo to the finite-dimensional 
distributions of {R{t),t > 0). 

Let Gnit) be the cr- field generated by the shape of the random recursive tree on n vertices at 
time t and the exponential random variables attached to the edges adjacent to the root. That is, 
Quit) includes all the information about the tree at time t except for the values of the exponential 
random variables on the edges that are not adjacent to the root. Let J-n{t) = (y{Qn{s)-, < s < t). 
Note that Rn{t) is ^„ (i)-measurable for all t > 0, and thus the process {Rn{t),t > 0) is adapted 
to the filtration J^n = {J'n{t),t > 0). Because {Rn{t),t > 0) is right continuous, it follows that 
{Rn{t),t > 0) is J>i-progressive (see p. 50 of [15j). 

Fix a function f £ C. Let En = n~'^ for all n € N. Let 



Ut) = — r E[f{Rn{t + s))\Tn{t)] ds 

£n JO 



and 

Mt) = —E[f{Rn{t + En)) - f{Rnm^n{t)]. 

By dH), -Rn(O) converges in distribution to -R(O) as n — )• oo. By Theorem 8.2 in chapter 4 of [15] 
(see also parts (a) and (b) of Remark 8.3), to show that the finite-dimensional distributions of 
{Rn{t),t > 0) converge to those of {R(t),t > 0), it suffices to show that the following hold for all 
t > 0: 

sup sup E[\Cn{s)\] < OO, (19) 

n 0<s<t 

sup sup £'[|99n(s)|] < OO, (20) 
n 0<s<t 

Urn E[\Ut)-f{Rnm]=0, (21) 
lim E[\^n{t) - {Af ){Rnm] = 0. (22) 

Note that ([THD is obvious because |Cn('5)| < ||/|| for all s > 0. To show ([2U|) . choose z > — oo 
such that / is constant on [— oo,z]. For s > 0, let Jg be the event that there exists a time 
u G [s,s + en] such that Rn{u) 7^ Rn{u—) and Rn{u) > z. Because the process {Rn{s),s > 0) 
decreases at speed one between jumps, we have f{Rn{s + En)) — f{Rn{s)) < en||/'|| on the event 
Jg. Because the process {Rn{s),s > 0) is stationary, we have P{Js) = P{Jo) for ah s > 0. 
Therefore, by Lemma [8l 

\fnis)\ < -E[\f{Rn{s + En)) " f{Rn{s))\ljc] + LE[\f{R^{s + En)) " f{Rn{s))\U] 

<ii/'ii + f ii/iim) 

< ||/'|| + (2 + 4e- 



which proves (|20|) . 
Next, observe that 

\ut)-f{Rnm = - 

En 



£r, 



E[f{Rn{t + S)) - f{Rr,{t))\Fn{t)] ds 







<En\\f\\+2\\f\\P{Jt\Fn{t)). 
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Thus, by Lemma [8l taking expectations of both sides gives 

mnit) - fiRnitm < ^nll/'ll + 2||/||P(Ji) < en||/'|| + 2e„||/||(l + 26"^) ^ 

as n — )• oo, which gives (|2ip . It remains only to show (j22p . 



When the tree is cut, we call the event a small cut if fewer than n/(logn)^/^ vertices are 
removed as a result of the cut, and a large cut otherwise. We define the following five events. 
Recall that z has been chosen so that / is constant on [—00, z]. 

• Let Ai be the event that between times t and t + £n, there is a small cut at some edge 
not adjacent to the root, one of the new edges attaches to the root and is assigned a label 
greater than loglogn + z, and this is the only edge that attaches to the root between times 
t and t + En and gets a label greater than log log n + z. 

• Let A2 be the event that between times t and i + e„, there is a large cut during which one 
of the new edges attaches to the root and is assigned a label greater than log log n + z. 

• Let A3 be the event that between times t and t + En, there is an event in which the tree is 
cut at some edge adjacent to the root, and one of the new edges attaches to the root and 
is assigned a label greater than log log n + z. 

• Let A/j^ be the event that between times t and t + e„, two or more new edges attach to the 
root and are assigned labels greater than log log n + z. 

• Let = n n A§ n Al. 

The next lemma shows that A2, A^, and are unlikely to occur, which means that jumps of 
the process Rn between times t and t + will occur primarily on the event Ai. 

Lemma 11. We have 

P{A2UAsUA4) „ 
lim = U. 

n— >co 

Proof. To bound ^(^2), note that an event during which k — 1 vertices are removed from the 
tree corresponds to a transition in the Bolthausen-Sznitman coalescent in which k blocks merge 
into one. Such events happen at rate [^)Xn,k = n/[k{k — 1)] by ([3]). When such an event occurs, 
the expected number of vertices that reattach to the root is l/(n — /c + 1) + • • • + l/(n — 1). 
When a vertex reattaches to the root, the probability that its label exceeds log log n + z is 
g-(iogiogn+2) _ g-2/iogn. Thus, for sufficiently large n, 

n / n— 1 ^ 

n I \ - 1 



k(k — V]\ ^ 7'/logn 

fc=|-n/(logn)i/2+l] ^ ' ^j=n-k+l-'^ 

^ Ene'"' / L"^+i ra 2(fc - 1) n ( }_ 

- logn I ^ k(k-l)' n ^ k(k-l)[ ^ j 



e„e ''(loglogn + 4) 
logn 



(23) 
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To bound P{A3), note that an event in which the tree is cut at some edge adjacent to the root 
corresponds to a merger in the Bolthausen-Sznitman coalescent that involves the block containing 
the integer 1. By the sampling consistency of the Bolthausen-Sznitman coalescent, every other 
block merges with this block at rate A2 = 1. Consequently, the expected number of blocks that 
are removed from the tree between times t and t + En when the tree is cut at an edge adjacent to 
the root is en{n — 1). Provided that fewer than n/(logn)^/^ vertices are removed as a result of 
the cut, the probability that a given vertex reattaches to the root is at most 2/n. The probability 
that the label on the new edge exceeds log log n + z is e~^/logn as before. Thus, for sufficiently 
large n 



n 



< 



(24) 



log n log n 

To bound P(^4), note that there are two ways that ^4 can occur. Either the tree can be cut 
twice between times t and t + Sn, or two or more edges can reattach to the root after a single cut. 
Because cuts of the tree happen at times of a Poisson process of rate = n — 1, the probability 
that two or more cuts happen between times t and t + is at most e^(n — 1)^. If there is an 
event in which k — \ vertices are removed from the tree following a cut, there are (^^2^) pairs of 
vertices that could reattach to the root. On A2, for sufficiently large n, the chance that two given 
vertices reattach to the root is at most and each new edge independently has probability 

e~^/(logn) of having a label greater than log log n + z. Thus, 



rn/(log n)i/2] 

P{A^r\Al)<el{n-lf + en Yl 



n 



k=3 



4e- 



-2z 



k - 1\ 

k{k-l)\ 2 yn2(logn)2 



< 4(n - If + 



2e-2^e 



(logn)^/2 

The result follows from (|23l) . (I2il). and ([25)1 . 
Lemma 12. We have 



(25) 
□ 



for sufficiently large n. Furthermore, 



lim E 

n— >oo 



P{Ai) < 2e-'en 



—P{Ai\J^n{t))-l 



0. 



Proof. Recall that in our construction using trees, we labeled the vertices by the time in which 
they were added rather than using the vertex labels 1, . . . ,n. However, for the purposes of this 
proof, we will arbitrarily number the vertices at time t by the integers 1, . . . ,n, with the root 
being vertex 1. For k > 2, let dk be the depth of vertex k, which is the number of edges on the 
path from the root to k. Let Vk be the number of edges along the path from the root to k that are 
not adjacent to the root but that have at least n/(logn)^/2 vertices below them. That is, Vk is 
the number of edges e along this path not adjacent to the root such that if we cut the tree at the 
edge e, it would be classified as a large cut. Define Dn{t) = d2 + - ■ • + dn and Vn{t) = V2 + - • • + Vn- 
For each k = 2, . . . ,n, we will separately bound the probability that Ai occurs and that k is the 
vertex that reattaches to the root with a label of at least log log n + z. 

Note that there are dk — 1 — vt edges not adjacent to the root such that, if the tree were cut 
at that edge, the vertex labeled k would be removed from the tree and this would be a small cut. 
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The probability that one of these edges is cut before time t + £n is 1 — e~^"^^''^^~'"''\ which is 
between en{dk — 1 — Vk) — e^(dfc — 1 — Vk)"^ /2 and e„((ifc — 1 — Vk)- The probabihty that the vertex 
labeled k reattaches to the root is between 1/n and l/(n — n/ylogn), and the probability that 
the new edge label is at least log log n + z is e^^/logn. Therefore, 



P{Ai\Tn{t)) < Y,^n{dk-l-Vk) 



1 



k=2 



n — n/^/Togn logn 



< 



SniPnit) - (n - 1) - Vn{t))e- 

nlogn 
enDn{t)e~'' / yiogn 
n log n \ yiogn — 1 



\/logn 
Vlogn - 1 



(26) 



Because < nlogn by Lemma [U it follows that P{Ai) < 2e for all n large enough 

that v/logn/(Vlogn - 1) < 2. 

Suppose the vertex k is cut from the tree. The probability that vertex k and some other 
vertex both reattach to the root with new edge labels greater than log log n + z is at most 



(n-2) 



1 



n — n/ -y/log n J \ log n 



< 



2e 



~2z 



n(logn) 



for sufficiently large n. Also, the probability that there are two cuts to the tree before time t + £n 
is at most (n — Combining these observations, we get 



PiAi\Tnit)) 



k=2 



e^(4 - 1 - Vkf 



2e 



-22 



> 



nlogn n(logn)^ 
en{Dn{t) - (n - 1) - Vn{t))e-' elDn{tfe-' 2eMt)e-^' 



el{n-l)'' 



Combining 



n log n 
and ([27]) gives 



— P{Ai\Tn{t))-l 



< 


Dn{t) ^ 




n log n 



+ 



2nlogn n (logn) 2 

{n-l) + Vn{t) 



(27) 



nlogn 



+ 



Dn{t) 



1 



^ EnPnitf ^ 2Dn{t)e- 



+ 



(28) 



n log n \ \/log n — I J ' 2nlogn n(logn)2 

We need to show that the six terms on the right-hand side of (j28p tend to zero in expectation 
as n — )• oo. By Lemma [9] and the Cauchy-Schwarz Inequality, 



E 



Dn{t) 



nlogn 



1 



< 



1 



n log n 



E[\Dn{t) - E[Dn{t)\\] + 



E[Dn{t)] 



^ Var(Zj„(t))V^ ^ 
~ n log n 



E[Dn{t)] 



nlogn 



1 



nlogn 
> 



1 



(29) 



as n — >■ oo. Because i?[T^(t)] is at most n times the rate of transitions in the Bolthausen-Sznitman 
coalescent that cause at least n/(logn)-^/^ blocks to be lost, we have 



E[Vn{t)] <n Yl 

fc=[n/{logn)i/2+l] 



n 



An,/ 



n 



E 



n 



fc=[n/(logn)i/2+i] 



k{k-l) 



n, 
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from which it fohows that the expected value of the second term on the right-hand side of (j28p 
tends to zero as n — )• oo. Using Lemma [9] and the fact that e„ = n~^, it is easily checked that 
the expectations of the last four terms on the right-hand side of (I28p tend to zero as n — t- oo. □ 

Proposition 13. The finite- dimensional distributions of the processes {Rn{t),t > 0) converge as 
n ^ oo to the finite- dimensional distributions of {R{t),t > 0). 



0, 



Proof. Recall that it remains only to show (j22p . which is equivalent to showing that 
1 



lim E 

n— >oo 



-E[f{Rn{t + £„)) - f{RnmMt)] + f{Rn{t)) 



Ru{t) 



e-y{f{y)-Rn{t))dy 



where / is a function in C which is constant on [— oo, z\. We evaluate f{Rn{t + e^)) — f{Rn{t)) 
on the three disjoint events ^i, A2iJ A^U A4, and A^. 
By Lemma [m 



E 



— E[{fiRnit + en))-f{Rnitm 



< 



P{A2 U A3 U A4) ^ (30) 



as n — )• 00. Note that f{Rn{t+en)) = f{Rn{t)—£n) on A^ because either the process {Rn{t),t > 0) 
does not jump between times t and t + Sn, in which case Rn{t + £n) = Rn(t) — En, or else 
Rn(t + £n) < z, in which case f{Rn{t + = f{Rn{i) — £n) = f{z)- Therefore, using Lemmas 
[n] and nil 



E 



1 



— E[{f{Rn{t + En)) - f{Rn{t)))U,\J'n{t)] + f\Rn{t)) 



= E 
< E 
= E 



E 



( f{Rn{t) 



f{Rn{t)) 



e-n. 



+ f'{Rn{t))]lA,+ f'{Rn{mA^ 



f{Rn{t)-en)-f{Rnit)) 



+nRn{t)) 



+ \\f'\\P{Al) 



ds 



+ \\f'\\P{AiUA2UAsUA4) 



lRn{t)-e„ 

< ^nliril + ||/'||(^'(^l) + ^^(^2) + P{A,) + P{A,)) ^ 



(31) 



as n — )• 00. 

On Ai, there is a unique time r G [t,t + e„] such that at time r, a new edge attaches to the 
root and is assigned a label greater than loglogn-l-z. Denote by K the value of this label minus 
log log n, and let J = maxjiT, Rn{t)}. Conditional on Ai and J^nit), the distribution of K has a 
density given by k{y) = e^^^l^yyz}- Therefore, 

E[{f{Rn{t + En)) - f{Rn{t))lA, \J'n{t)] 

= E[{f{Rn{t + En)) - f{J))lA,\:Fn{t)] + E[{f {J) - f {Rn{t))l A,\Mt)] 

= E[{f{Rn{t + En)) - f{J))lA,\J'n{t)] + ( T e'^y {f {y) - f {Rn{t)) dy\p{A^\Tn{t)). 

\jRnit) J 

Note that J — En < Rn{t + En) < J OU ^i, SO 

\E[f{Rn{t + En)) - f{J))lAAJ'n{t)]\ < £„ || /' || P(Ai | J-„ (t)) . 
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It follows that 
E 



1 /"^ 

-E[f{R4t + en))- fiRnimA.lJ'nit)]- / e-^(/(y) - dy 

JRnit) 



<\\f\\P{A,) + E 



e-y{fiy) - f{Rn{t)) dy ) _ 1 



Ru{t) 



<||/'||P(^i) + 2||/||e-i? 



-P{Ai\Fn{t)) - 1 



(32) 



It now follows from Lemma [T2] that the right-hand side of (j32p tends to zero as n — J- cxd. The 
result now follows by combining this observation with pop and (I3ip . □ 

2.6 Tightness 

Here we show that the sequence of processes [Rn)'^=i is relatively compact. By Theorem 7.8 
in Chapter 3 of |15j , this result in combination with Proposition [13] implies that the processes 
{Rn{t),t > 0) converge in the Skorohod topology to {R{t),t > 0). 
For 6 > and t > 0, let 

Wn{S,t) = inf max sup |i?n(r) — i?„,(s)|, 



where the infimum is taken over all partitions of the form = to < < ' ' ' < = * such that 
ti — ti-i > (5 for « = 1, . . . , m. By Corollary 7.4 in Chapter 3 of [15], the sequence {Rn)'^=i is 
relatively compact provided that for all e > and t > 0, there exists J > such that 



limsupP(it;„((5, t) > e) < e. 



(33) 



Therefore, we can conclude the proof of Theorem [2] by showing (|33p . 

Fix t > 0. Choose e G (0, 1), and choose y < such that P(ii„(0) <y + t)< e/Q. Note that 
if ii„(0) > y + t, then -R„(s) > y for ah s <t. Let 

e2% 



36max{t, 1} 



Let < Tin < T2n < ■ ■ ■ denote the jump times of (^Rni^s^^s ^ 0). As long as tj^^ — Tj—\^n ^ ^ 
for all j such that rj^„ < t and there are no jump times in [0, (^] or [t — 5,t\^ it is easy to choose 
the times to, . . . ,tn such that 6 < ti — < 25 for i = 1, . . . , n and for all j such that rj^„ < t, 
we have Tj^n = ti for some i. That is, there is one of the ti at every jump time of the process. In 
this case, whenever r,s G [ti-i,ti), we have \Rnir) — Rn{s)\ = \r — s\ <2S < e. 
By Lemmas 1111 and 1121 with 6 in place of and y in place of z, we have 



lim sup P{Rn{s) / Rn{s-) for some s £ [0, 5]U[t- 5, t]) 



< limsupP(i?„(0) <y + t) + 2e-^5 + le'^d < 



(34) 



Note that if Tj-^„ — Tj-i^n ^ ^ for some j such that Tj^n ^ then there exists a nonnegative integer 
k < t/5 — 1 such that k5 < Tj_i „ < rj^„ < min{t, (A; + 2)5}. For two jumps of the process 
{Rnis),s > 0) to fall within the interval [k6,m.m{t, (k + 2)5}], one of the following four events 
must occur: 
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• We have Rnis) < y for some s G [0,t]. 

• Between times k6 and {k + 2)6, there is a large cut, and one of the new edges attaches to 
the root and is assigned a label greater than log log n + y. 

• Between times k6 and {k + 2)6, more than 35nlogn vertices are removed from the tree 
during small cuts. 

• Of the first [3(5nlognJ vertices, after time k6, that are removed from the tree during small 
cuts, two or more reattach to the root with new edge labels greater than log log n + y. 

We have already bounded the probability of the first event by e/6. The other three events 
depend on k. The probability of the second event tends to zero as n — )• oo by (j23|) with 26 in place 
of En and y in place of z. To bound the probability of the third event, note that mergers in the 
Bolthausen-Sznitman coalescent in which k — 1 blocks are lost occur at rate (^)Xn,k = n/{k{k — l)). 
Therefore, if N denotes the number of vertices removed during small cuts between times k6 and 
{k + 2)5, we have 

E[N] = 26 Yl (^-1)-Hr^ = 2'^ E ^<2<^^logn 

k=2 ^ ' k=2 

and 

rn/(logn)i/2] 

Var(iV) = 25 V {k - 1)^ • — ^ < 25n\ 
^ k(k — 1) 

k=2 ^ ' 

Therefore, by Chebyshev's Inequality, 



25n 

P{N > 36nlogn) < P{\N - E[N]\ > 5nlogn) < 



(5^72^ (log n)2 

as n — !• oo. Finally, concerning the fourth event, note that when a vertex is reattached after 
being removed during a small cut, the probability that it reattaches to the root is at most 2/n, 
and the probability that it is assigned a label greater than loglogn + y is e^^/(logn). Therefore, 
since there are at most (3(5n log n)'^/2 pairs of vertices to consider, the probability of the fourth 
event for a particular k is at most 

2 (nlogn)^ 

Since there are at most t/6 possible values of k to consider, the probability that the fourth event 
occurs for some k is at most 18t5e~^^ < e/2. Therefore, 

limsup P{Tj^n — 'Tj-i,n < ^ for some j such that rj^„ <t)<- + -< —. 
n— >oo D 2 o 

Combining this result with ([M]) gives ([33]) and completes the proof of Theorem [2l 
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3 Proof of Theorem [7] 



We obtain Theorems [5] and [7] using a very different approach. Rather than using recursive trees, 
we couple the population model with a family of stable processes by constructing both from a 
Poisson process. We describe this construction in section 13.11 We then prove Theorem [7] in 
section 13.41 and we prove Theorem O in section [H 

Throughout the rest of the paper, T > will be an arbitrary positive constant, and 

Tn = 2 log log n. 

Also, — )-p will denote convergence in probability as n — t- oo. 

3.1 A Poisson process construction 

Fix a positive integer n. Let be a Poisson point process on R x (0, oo) whose intensity mea- 
sure is given by dt x dy. Then define G to be the image of ^ under the map (t, y) i— )• 
(— t/logn, y/logn), restricted to M x (0,1]. That is, if {t,y) is a point of ^ with y < logn, then 
(— t/ logn, y/ logn) is a point of ^. Note that B is a Poisson point process on M x (0,1] with 
intensity measure dt x dy. 

We now construct a population model consisting of n individuals labelled 1, . . . ,n. We in- 
dependently attach to each point {ti,yi) of Q independent random variables Ui^i, . . . , each 
having the uniform distribution on (0, 1). If zero or one of the random variables C/j^i, . . . , Ui^n 
is less than yi, then there is no change in the population at time tj. However, if > 2 of the 
random variables Ui^i, . . . , Ui^n are less than y^ and the k smallest of these random variables are 
< ■ ■ ■ < Uiji^, then at time tj, the individuals labeled j2, ■ ■ ■ ,jk are killed, and the individual 
labeled ji gives birth to A; — 1 new offspring, which assume the labels j2, ■ ■ ■ ,jk- 

To see that this is equivalent to the population model described in the introduction, note 
that if {ti,yi) is a point of 0, the probability that exactly k new offspring are born at time ti is 
{k+i)yi~^^i^ — yi)'^~^~^. Thus, the rate of events in which exactly k new offspring are born is 

which matches ([2|) because changes in the population occur at rate n — 1. 

For s G M and t > 0, let Nn{s,t) denote the number of individuals in the population im- 
mediately before time s — t who have a descendant alive in the population at time s. That is, 
Nn{s, t) is the number of ancestral lines remaining after time t if we trace back the ancestral lines 
of the individuals in the population at time s. Note that because we consider the population 
immediately before time s — t rather than exactly at time s — t when defining Nn{s, t), the process 
{Nn{s,t),t > 0) is right continuous. Also, note that N{s,0) = n as long as there is no change in 
the population at time s, but if k individuals are killed at time s and replaced by new offspring, 
then A^(s,0) = n — k. Let Nn{t) = A^„(0,t). Because the genealogy of this population is given 
by the Bolthausen-Sznitman coalescent started with n blocks, the process {Nn{t),t > 0) has the 
same law as the process defined in the statement of Theorem [71 Let 

/"OO 

Ln{s)= A^„(s,t)l|^r„(^^i)>i} dt. (35) 

Jo 

Then Ln{s) is the total branch length for the coalescent tree representing the genealogy of the 
population at time s, so (-L„(s), s > 0) has the same law as the process considered in Theorem [3 
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Next, we use the Poisson process ^' to construct, for each s G M, a stable process {S{s, t),t > 0) 
with characteristic exponent 

^[gmS(s,t)] = exp - y |u| + itulog \u\^ 



(36) 



where 7 denotes Euler's constant. Because ^ has only countably many points, we can enumerate 
the points of ^ as {sj,Xj)JLi. If sj > —s, then the process {S{s,t),t > 0) will have a jump of 
size —Xj at time Sj + s. To make this construction precise, we use a standard approximation 
procedure that is described, for example, in Section I.l of Define 



-VTogri 



which implies that for all a > 0, 



and 



lim (logn)''e„ = 

n— >oo 



lim n"e„ = 00. 



Let Rn{s, t) be the set of all j such that —s < Sj < —s + 1 and Xj > e„. For all s G M and t >0, 
let 

Sn{s,t) =t{l -J -log En) - ^ Xj. (37) 

jeRn{s,t) 

Note that xl^^^^^r^^^iy x~^ dx = loge,i. Therefore, for any fixed T > and any integers 
m,n > N, the proof of Theorem 1 in Section I.l of [Ij (see the bottom of p. 14) gives 



E 



sup {Sn{s,t) - Sm{s,t)y 
0<t<T 



< AT / x\i,i<,^y dx < ATen. 
Jo 



(38) 



It follows that for each s G M, there is a limit process {S{s,t),t > 0) satisfying ()36p such that for 
each fixed T > 0, 



E 



sup {Sn{s,t) - S{s,t)y 
0<t<T 



< ATEn 



(39) 



as n oo. Let Sn{t) = 5'„(0,t) and S{t) = S{0,t) for all t > 0. 



3.2 Bounds on stable processes 

The three lemmas below collect some bounds on these stable processes that will be needed later. 
Lemma 14. Fix e > 0. Then there exists K > such that 



P[ sup sup e |5(s,t)| > K ] < e. 

,0<s<T t>0 



Additionally, we have 



lim pi sup sup |5'(s,t)| > 5(loglogri)^ ) = 0. 

n->oo \o<s<TO<t<T„+s 



(40) 



(41) 
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Also, 

lim P(|5(r„)|<r2) = l (42) 



and 



POO 

sup / e~^S{s,t) dt ^pO. (43) 

0<s<T Jt„+s 



Proof. The process {S{t),t > 0) is a stable process of index 1. Therefore, by Proposition 48.10 
of [30j, we have 

hmsupt-"|5(t)| = 

almost surely for all a > 1. Therefore, there exists a random time U with P{U < oo) = 1 such 
that \S{t)\ < for all t > U. This implies (02]). 

Let L = supo<t<[/ \S{t)\ and M = supo<f<T \S(T,t)\. Note that L < oo and M < oo almost 
surely. Suppose < s < T. By the construction, if < t < s then 

S{s,t) = S{T,T - s + t) - S{T,T - s), (44) 

while if t > s then 

S{s, t) = S{s, s) + S(0, t-s) = S{T, T) - S{T, T - s) + 5(0, t - s). (45) 

Therefore, 

\S{s,t)\ <2M + L + t'^. (46) 
Since e~*t^ < 1 for ah t > 0, it follows that 

sup sup e-*|5(s,t)| < 2M + L + 1. 

0<s<T t>0 

Now choose K large enough that P{2M + L + 1 > K) < e to obtain (l40]l . Furthermore, (|46]) 
implies that 

sup sup |S'(s,t)| < 2Af + L + (21oglogn + r)2, 

0<s<TO<t<T„+s 

and dH]) follows because P{2M + L + (2 log log n + r)^ > 5(loglogn)2) — )- as n — )■ oo. On the 
event that 2M + L < T^, which has probability tending to one as n — t- oo, equation (j46p gives 
that for < s < T, 

/ e-*\S{s,t)\dt<2 I e^H^dt^O 



as n — )• OO, which gives (j43|) . □ 
Lemma 15. VFe /laue 

sup sup \Sn{s,t) — S{s,t)\ 0. 

0<S<T 0<t<Tn+S 

Proof. Suppose < s < T. By (jli]) . which holds also with 5„ in place of S, if < t < s then 

\Sn{s, t) - S{s, t)\ = \Sn{T,T- s + t)- Sn(T, T - s) - S{T,T- s + t) + S{T,T- s)\ 
<2 sup \Sn{T,u) — S{T,u)\. 

0<u<T 
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By dlS]), if s <t <Tn + s, then 



\Snis,t)-Sis,t)\ 

= \Sn{T, T) - Sn{T, T-s) + Sn{0, t - s) - S{T, T) + S{T, T - s) - S(0, t - s) 
<2 sup \Sn{T,u) - S{T,u)\+ sup |S'„(0, u) - 5(0, n)|. 

Q<u<T 0<u<T„ 

Using that (a + 6)^ < 2a^ + 26^ and applying we get 



sup sup |S'„(s, t) — ^(s, t)| 

0<s<TO<t<T„+s 



< 32re„ + 8Tn£n ^ 



as n — )• c«, which imphes the result. 



□ 



Lemma 16. Suppose s G M and {6n)^=i is a sequence of numbers with 5„ylogn — as n — )• oo. 
Let 9 > 0, and let A{9) be the event that there are no points of ^ in [— s,— s + 5„] x [9,oo). Then 
for sufficiently large n, 



p( sup |5„(s,t)| >ei/3 

\0<t<Sn 



A{9)] <^Sn9^^\ 



Proof. By (j37j) . we have 



sup \S{s,t)\ < 6n(,l -7 -logSn) + ^ Xj 



Because | loge„| = \/log n and 5n\/\og n — )• 0, we have — 7 — log^n) < 6*^/^/4 for sufficiently 
large n. Furthermore, 













iG-Rn(s,t) 







(5„ / X ■ X dx = (5n(log 6* - log e„) 



as n — )• OO and 



Var(^ ^ x,A{9)^=6nj^ 



A{9) ] =6n I x^ ■ x-^ dx < 6n9. 



' je-Rn(s,i) 

Therefore, by Chebyshev's Inequality, for sufficiently large n, 



p( sup \Sn{s,t)\>9^/^ A{9)] < p( V Xj 



> 



j&Rn{s,t) 



A{9) 



< P 



<5r,.9 



E ^^-^ E 



j€Rn{s,t) 

4 



jeR„{s,t) 



Ai9) 



)l/3 



> 



02/3 / ' 



which gives the result. 



(47) 
□ 
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3.3 Rate of decrease in the number of blocks 



We record here some results about the rate at which the number of blocks decreases in the 
Bolthausen-Sznitman coalescent. Recall that the process (Nn{t),t > 0) tracks the evolution of 
the number of blocks over time. Because the number of blocks decreases by A; — 1 whenever k 
blocks merge into one, the rate at which the number of blocks is decreasing when there are b 
blocks is 

V{b) = j2(k-l)(l)xt,k. (48) 

k=2 ^ ^ 

Considering the process from the perspective of the construction in subsection 13. 1|, suppose there 
is a point {—t,y) in the Poisson process 0. If Nn{t—) = b, then Nn{t—) — Nn{t) is one less than 
the number out of b independent uniformly distributed random variables that are less than or 
equal to y, unless all of the random variables are greater than y in which case Nn{t—) — Nn{t) = 0. 
Therefore, the expected decrease in the process A'^^ at time t is 6y — 1 + (1 — y)^. It follows that 



riib) 



[\by-l + {l-y)')y-^dy. 
Jo 



We will be interested also in the process obtained by removing from the points whose 
second coordinate exceeds ^n/logn. In this case, the genealogy of the population is given not 
by the Bolthausen-Sznitman coalescent but by a different coalescent process in which the largest 
merger events are suppressed. In particular, when there are b blocks, the rate at which k blocks 
merge into one is given by 

!•£„/ logn 

Kk= x'^-\l - x)"-' dx (49) 







and the rate at which the number of blocks is decreasing is given by 

V*{b) = Y.(k-l)iAK,k= / {by-l + {l-y)')y-'dy. 



Also, let 



■w=E(*-i)^Q 



Kk- (50) 



k=2 

Lemma 17. For all positive integers b and n with 2 < b < n, we have 
I I log n 

\r]*{b) - 6(log6 - log log n + loge„ + 7 - 1) < h 1, (51) 



where 7 denotes Euler's constant. Also, 



v*ib) < p^. (52) 
logn 



Proof. Using ([18]) and we get 



k=2 
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Now 

7-1 



lim f - — log b] 

0<(,„g,-l„g(,-l))-l=£(i-l)<,.<^-l, 

°° / 1 1\ 

Z ^ / 



6-s> 

and 



so 



k=h+\ 

for all It follows that 

f£„/logn 



Jo 

= [ (6y - 1 + (1 - dy - f {by - 1 + {1 - yf)y-^ dy 

JO J Sn/ log n 

= vib) + 61og (-^) + C (1 - (1 - yf)y-^ dy. (54) 

\10gny ye„/logn 

Because the last integral is bounded by J^/\ogny~'^ ^2/ = {^ogn)/en, the result ([5T]) follows by 
combining (j53p and (|54p . 

To prove ([52|) . note that if {—t, y) is a point of and Nn{t—) = b, then because (/c — 1)^ < 2(2) 
for k > 2, the expected square of the decrease in the process Nn at time t is at most twice the 
expected number of pairs out of b independent uniformly distributed random variables having 
the property that both random variables are less than or equal to y, which is 2{^y^. Therefore, 

v*{b)< / 2 ':W■y-'dy<^'- 



J0 logra 
as claimed. □ 

3.4 The coupling 

In this section, we prove Theorem [71 We use the construction and notation of section [3TT1 As in 
the statement of Theorem [71 let 



for all t > 0. Also, let 



= '-^(nJ^)- ne- - a^^l}^^ ) (55) 
n \ \ log n J log n 



Yn{t) = e-'Sn{t) + 



2 
and 

Y{t) = e-'S{t) + '-f. 

We need to show that the processes {Xn{t),t > 0) converge to {Y{t),t > 0) in the Skorohod 
topology. Because 



E 



sup {Yn{t)-Y{t)y 

0<t<Tn 



< AT„e„. 
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as n — )• oo by ([39]) , it suffices to show tliat 



sup |X„(t)-y„(t)| ^pO. (56) 

0<t<T„ 

In fact, to prove Theorem [71 it would suffice to show (j56p with the arbitrary fixed constant T in 
place of T„, but it will be helpful for the proof of Theorem [5] to control the difference between 
Xfi and Yn up to time r„. 

Let < Ti < • • • < Tj^ < Tn be the jump times of the process {Sn{t),t > 0) before time 
T„. Let To = and tj„+i = T„. Note that the Tj depend on n even though we do not record 
this dependence in the notation. This means that there are points {Ti,yi) in ^ with yj > e„ for 
1 = 1,..., Jn- Also, the process G, which is used to construct the population process, contains the 
points {—Ti/ \ogn,yi/ logn) but G contains no points in the regions (— Tj+i/ log n, — r^/ log n) x 
[Sn/ iogn,l]. Therefore, conditional on ti,...,tj„, between times Tj/logn and Tj+i/logre, the 
process Nn follows the dynamics of the number of blocks in a coalescent process with transition 
rates given by (I49p . 

For i = 0,1, Jn and t £ [0, Tj+i — Ti), let 



Ti+t 



logn / \logn 



(Ti+t)/\ogn 



M,,n{t) = Nni ]-Mt^]+ / ri*iNn{s)) ds. (57) 



Tj/logn 



By standard results about compensators for Markov jump processes (see, for example. Theorem 
9.15 in [21]), the process (Mj^„(t),0 < t < Tj+i — Tj) is a martingale. Note also that Tj+i — Tj is 
exponentially distributed with mean e„ and is independent of the evolution of the process Mj^„ 
before time Tj+i — Tj. Next, for tj <t < Tj+i, let 

i-i 

Mn{t) = Y,Mi,n{{Ti+l -Ti)-) + Mj,n{t -Tj). (58) 
i=0 

Then the process (M„(t),0 <t< Tn) is a martingale. 
Lemma 18. We have 

^/logn \Mnit)\ > < iilogn)^Tn£n. 

Proof. By standard results about compensated Markov jump processes (see, for example. Corol- 
lary 9.17 of [25), 

" i-Tn/logn 

Yav{Mn{Tn)) = E / V*{Nn{t))dt 

-Jo 

where v* was defined in ([50]) . Therefore, by (152]) . 

Var(M„(r„)) < 

(logn)"' 

By the Maximum Inequality for martingales, 

4n Tn.Sn 



E 



sup Mn{tY 
0<t<T„ 



< 



(log n)2 
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Thus, by Markov's Inequality, 



P( 1^ sup |M„(t)| > =p( sup M„(t)2 > ) < 4(logn)Xen, 



as claimed. □ 
For < t < T„, let 

/ N / X logn,^ , , logn/ ^ f t \ ,^ / N / nte~*loglogn\ , ^, 

n n \ \logn J log n J 



Lemma [18] implies that 



sup ^^M„(t)|^pO 



o<t<T„ n 

as n — )• oo. Therefore, to show ([^5]) . it suffices to compare the processes and y^. 

For s > 0, let (rf , ), (t|, . . . , (rjs , y^) denote the points of ^ in [— s, T„] x [e„, oo), ranked 
so that < • • • < rjj, . Let Tq = — s and Tjs_^_i = Tn- Note that r^^ = Tj for f = 0, 1, . . . , J„ + 1. 
Also let 2/j = for i = 1 , . . . , J„ . Li the next Lemma, we show that several events hold with 
high probability. This result will allow us to assume that these events holds throughout much of 
the rest of the paper. 

Lemma 19. Let Ai^n be the event that < n^/^. Let A2^n be the event that 



5^ yj < (log i 
1=1 

Let A^^n be the event that t^-^ — rf < En log n for i = 0, 1, . . . , Jj. Let An = Ai^n H A2^n H A^^n- 
Then lim„^oo -P(^n) = 1- 

Proof. Note that 



E[Jl] = {Tn + T) y-^dy 



{Tn + T) 



En 



It follows from Markov's Inequality that 

{Tn + T ) 
n— 5>oo It ' c. 



hm sup P{A'{ „) < lim sup " , — - = 0, 



and thus lim„,_s>oo -P(^i,n) = 1- 
We have 

Tn+T 

logn 



P ( max yi > logn ]< {Tn + T) y dy 

\^<i<J^ J Jlogn 



as n — )• OO. Also, 



E 



"n /•logn 

yj max yj < log n = {Tn + T) I y ■ dy = {Tn + T) (log log n - log En) 



Because — loge„ = (logn)^/^ and thus {Tn + T)(loglogn — loge„)/(logn)^/^ — )• as n — )• oo, it 
now follows from Markov's Inequality that lim^-i^oo -P(^2,n) = 1- 
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To estimate P{A3^n), let ak = —T + (fee„ logn)/2 for /c = 0, 1, . . . , and let K = minjfe : ak > 
Tn}. Note that if t^-^ — rf > En logn, then some interval of the form [afe_i, Ofe] with 1 < k < K 
must not contain any of the points t^. The probability that [ak-i,ak] does not contain any of 
the tT is 

En log n 1 



exp ( -{ak- afc_i) f y~ 



= exp - 



It follows that for sufficiently large n, 



as n — > oo. Therefore, lim„_!>oo 



P{A 



3,n) 



1, which completes the proof. 



□ 



The next lemma shows that the processes and Zn typically jump by approximately the 
same amount at the jump times r^. 



Lemma 20. We have 

J n 



pUn^i £ |(^n(ri) - ^n(r-)) - {Yn{n) - r„(ri-)) | l{|x„(.,-)|<log 



> 



1 



'''^ " (logn) 1/8 

r„(logn)3/2 



< 



for sufficiently large n. 

Proof. Recall that (tj, yi) is a point of * for i = 1, ... , J„. Therefore, Sn{ri) — Sn{Ti—) = —yi, so 

Ynin) - Ynin-) = -e-^'yi. (60) 
Let Ti = Tn and yi = on {i > J„}, and let Qi = a{Ti, Nniiri/ \ogn)-),yi). Then on {i < J„}, 



logn 



-N„ 



logn 



= min{0, 1 - Bi}, 



(61) 



where, conditional on Qi, the distribution of Bi is binomial with parameters A^„((''"i/logn)— ) and 
yi/logn. Here Bi represents the number of lineages out of A^n((ri/logn)— ) that merge at time 
Ti/logn. We have Var(Bj|^j) < nyi/logn on {i < J„}, so by the Cauchy-Schwarz Inequality, 



E 



Bi-Nn 



log n J log n 



< E 



Bi - Nr, 



log n / log n 







1/2 / 


f 


Qi 











nyj 
logn 



1/2 



on {i < J„}. Multiplying both sides by l{i<j„}l{jyj<iogn}) which is ^j-measurable, and taking 
expectations gives 



E 



Bi - N„ 



Vi 



Mi<Jn}^{yi<logn} 



< n^/^P{i < Jn 



^logn J logn 

Summing over i and observing that A2^n H < Jn} C {yi < logn} for all i, we get 



E 



1=1 



B^-Nn 



Ti 



log n y log n 



< n 
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Thus, 



Also, 



Pi ^2,n n 



log n 



n 



J n 

E 

i=l 



Bi - iV. 



n 



log n J log n 



> 



(log n)i/2 



< 



r„(logn)3/2 



(62) 



n V log n 



Vil ^ V / ^ , -Ti , "-Tie ^Moglogn . _ 

Xn{Ti-) + ne ^^^3-;^^ )-e "y^ 



n \logn 

Xnin-) , Ti.e-^' log log n 



log n 



+ 



log n log n 

which on the event {|X„(Ti— )| < loglogn} is bounded by 2yj(loglogn)/logn. Thus, on ^2,71, 



J n 

E 

i=l 



Vi 



n V log n 



e 'Vi 



L{|X„(T,-)|<loglogn} 



< 



2 log log n 
log n 



Jn 



1=1 



2 log log n 
(log n) 1/4 • 



(63) 



By m, 



^ogn 

Zniji) - Zniji-) + —^Bi 



log n 



n 



Nr. 



logn 



Nr. 



and so on Ai < 



Jn 

Y. 

i=l 



Zn{n) - Znin-) H — ^—Bi 
n 



< 



logn 
log n 



+ B, 



< 



logn 



n 



n 



3/4 ■ 



Combining ([62]) . ([63]) . and we get 



(64) 



P( ^„ n i ^ |(Z„(r,) - Zr^in-)) + e-"'yi|l||x„(r,^ 



)|<loglogn} 



i=l 



> 



1 2 log log n log n ) \ T„ (log n)^/^ 



+ 



+ 



(logn)i/2 (logn)V4 ^3/4 
The result follows by combining this result with (I60p and using that 



for sufficiently large n. 



< 



1 2 log logn logn 1 
H , H ?T- < 



(logn) 1/2 (logn) 1/4 n3/4 - (logn)i/8 



(65) 



□ 



Lemmas 1211 and 1231 below pertain to the behavior of the processes and Z„ in between the 
jump times Tj. 

Lemma 21. IfO<h< Tj+i — Tj < logn and n is sufficiently large, then on {i < Jn}, we have 



Yniji + h)- Ynin) -he ^'(1 - 7 - log^n + Ti)+ / y„(s) ds 



^{5„{r,)<ilogn} 



< h logn. 
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Proof. By the construction of the process Sn, we have Sniu + h) — Sn{Ti) = — 7 — loge„). 
Using 0{h?) to denote an expression whose absolute value is at most h?, Taylor's Theorem gives 

Therefore, 



p-(Ti+h) f , u\2 p-nj-'i 

Ynir, + h)- y„(r,) = e-(^"+^)5„(r, + /i) - e-^'5„(r,) + ^^—^ ^ 

= {e-^' - he-^^ + 0{h^)){Sn{Ti) + /i(l - 7 - loge„)) - e-^^Sn{n) 

+ h( ne-^^ - ^rf e-^M + 0{h^'' 



he ^'(1 - 7 - loge„ + Ti)-h[e ''^Snin) + 



+ 0{h') - h^e-^^{l - 7 - loge„) + 0{h'){Sn{n) + h{l-j- loge„)). 



On the event that Sn{Ti) < ;jlogn, for sufficiently large n, the absolute value of the sum of the 
first two terms is bounded above by ^hlogn, while the absolute value of the sum of the last 
three terms is bounded above by i/i^logn. Therefore, for sufficiently large n, on the event that 
'S'ra(rj) < 4logn, we have 



Ynin + h)- Ynin) - he-^^{l - 7 - loge„ + n) + hYr^in) 



< -h^logn. 



(66) 



Also, for sufficiently large n, if < s < h, then \ Yn{Ti + s) — Yn{Ti)\ < log n + log n < s logn 
on the event that Sn{Ti) < \ logn because h < En logn < 1. Therefore, 



Yn{s) ds - hYnin) 



< I {Ynin + s) - Yn{n)\ ds < I slognds = -h^logn. (67) 



By combining (j66p and (j67p . we arrive at the statement of the lemma. 



□ 



Lemma 22. Suppose < h < Tj+i — Ti < e„logn and Ti < s < ti + h. If n is sufficiently large 
and \Xn{s)\ < log logn, then 



log n 



logiVn 



log n 



log(ne ^') 



< 



8n(log logn) 
log n 



(68) 



Proof. In view of (j55p . if Tj < s < Ti + h, then, using that h < e^logn < (log log n)/ log n for 
sufficiently large n and using the assumption that |X„(s)| < log logn, we get 



Nr, 



log n 



ne 



nXn(s) _„ nse * log logn 

+ ne H ; ne 



log n 



logn 



I „ ^,1 nse * (log logn) 

< ne""* - ne"^M H — ^-^ + 

' ' log n 

n log log n n log log n 

< nh-\ ; h 



nXn{s) 



log n 



< 



logn 
3n log log n 
log n 



log n 



(69) 
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for sufficiently large n. 

We consider two cases. First, suppose ne""^* > (4n log log n)/ log n. Because (j69]) implies that 
ne""^* > ^A^„(s/ log n), we get, using ([69]) and the fact that -^logx = l/x, 

-1 



Therefore, 



logiVr, 



Nn 



log n 



\ogn 



log(ne 



logiVr, 



< 



log n 



3n log log n 
log n 

log(ne~'^') 



< 



4 ylogn 



\2n log logn 



log n 



Next, suppose that ne""^* < (4n log logn)/ logn. To determine the value of A^n(s/(log n)) 
that maximizes the left-hand side of ()68p . we consider the function f{x) = x(logx — log a), where 
a > is a constant. Note that f'{x) = 1 -|-log(x/a), which is positive when x > a/e and negative 
when X < a/e. Thus, /(x) is negative when x < a and reaches its minimum when x = a/e, and 
f{x) is positive and increasing for x > a. We conclude that ([68]) will hold in general provided 
that it holds if ne~'^^/e or ne""^' + (3n log log n)/ log n is plugged in for A^„,(s/(log n)). Note that 
by (|69p. we need not consider larger values. In the former case, the expression that we get is 

4n log log n 



ne 



log 



ne 



log(ne 



ne 



< 



elogn 



In the latter case, since ne > ne = n/{logn)'^, the expression that we get is bounded above 
by 



7n log log n 
log n 



log 



7n log log n 



< 



logn 
7n log log n 
log n 



log 



n 



(log n)2 

log 7 + log log log n + log log n 



□ 



Thus, (j68p holds for sufficiently large n. 

Lemma 23. IfO<h< r,_|_i — Tj < logn and n is sufficiently large, then on {i < J„}, ?«e have 

m+h 



Znin + h)- Znin) - /ie-^'(l - 7 - loge„ + n) + / Xn{t) 



dt 



{ I (s) I < log log n V se [r, ,ri } 



^ ^2n \i/2 , 3/i log logn 

< /i (logn) / +— -— . 

(logn)^'"^ 

Proof. Throughout the proof, we will work on the event that i < Jn and |X„(s)| < log logn for 
all s £ [Ti,Ti + h]. By the definition ([59]) of 



Znin + h)- Znin 



log n 



n 



Tj + h 
log n 



log n 



M„(ri + /i)+M„(ri) 



ne + ne" 



n 



(tj + h)e ('^*+'') log log n nr^e log log n 



log n 



+ 



logn 



Using the definition of M„ in ([57|) and (j58|) . we get 



Znin + h)- Znin) 



logn 



(Ti+/i)/logn 



J n/ logn 



77*(iV„(t)) + (logn)(e~^' - e-(^>+'*)) 
+ (loglogn)(rie"^» - (r^ + /i)e" 



-(r^+/^)^ 



(70) 
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To estimate the integral, we need to estimate 7]*(Nn{t)). Equation (jSip . which holds for 
sufficiently large n even when 6 = 1 if we define ?7*(1) = 0, gives 

1 , loff n 

|r/*(iV„(t))-iV„(t)(logiV„(t)-loglogn + loge„ + 7-l)| < ^ + 1. (71) 

It follows from (|7ip and Lemma after making the substitution t = s/logn, that if Tj/logn < 
t < {Ti + h)/ log n, then 

\v*{N^{t)) - N^mogn- loglogn + loge„ + 7 - 1 - t,)| < 9^(^°gl"g^)' (72) 

log n 

for sufficiently large n. 
Using ([55]) . we have 

{Ti+h)/\ogn 2 rTi+h. / s \ 

Nn{t)dt = - / nJ- )ds 

1 r'~^'^ , nse~* loglogn nXn(s) , 
' ne'" H ; — + , ^ ' ds 



logn y^. logn logn 

logn (logn)"' ^ ' 



(log ny J^^ 

Combining this result with ([72]) and letting ^ denote an expression whose absolute value is less 
than 9/i(loglogn)^/(logn), we get 



logn 



(Ti+h) /logn 

r]*{Nn{t))dt 

J n/ logn 

(Ti+h)/ logn 



Nn{t) dt + i 

Ti/ logn 



(log n)(log n - log log n + log e„ + 7 - 1 - Ti 
n 

(logn - loglogn + log e„ + 7 - 1 - ri)(e"^' - e"(^'+'')) 
+ (loglogn)((Ti + l)e-^' - (r, + /i + l)e-(^"+''))) + / '^ X„(s) 

J Ti 

(-loglogn + loge„ + 7 - 1 - Tj)(log logn) , 1^^-^,; , , 1^„-(r,+/i)^ 



\ logn / 

Recall that logEri = — (logn)^/^. One can show using calculus that if g{x) = (x + l)e~^, then 
|5r'(x)| < 1 for ah x > 0. Consequently, we have |(ri + l)e"^' - (r^ + /i + l)e"(^»+''))| < /i. Also, 
because we are assuming |X„(s)| < loglogn for all s G [rjTj + /i], we have | j^';'^^ Xn{s) ds\ < 
/i log logn. It follows from these observations that the sum of the terms on the last two lines of 
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(j73p is bounded by 3/i(log log n)(log n) for sufficiently large n. Therefore, 

/ V*iNnit)) dt = (logn)(e-^> - e-(^-+'^)) + (loglogn)(rie-^' - {n + /i)e-(^>+'^)) 

^ -J Til log n 

fTi+h 

+ {\ogen + 7 - 1 - ri){e-^^ - e-(^»+'^)) + / ds + i\ 

where |^'| < 3/i(loglogn)(logn)~^/^. Combining this result with (|70p . we get 

Zn{Ti + h)- Zn{Ti) = (1 - 7 " loge„ + r,)(e-^' - e-(^»+^)) - P^'^ Xn{s) ds - C'. 

Because -loge„ = (logn)^/^ and |(e~^' - e~^'^'~^^^) - /ie"^*| < the result follows. □ 

Lemma 24. Define the events 

-Bi,n = I sup supe~*|5(s,t)| < ;^ loglognl. 
I o<s<r t>o ^ J 

-B2,n = < sup sup 15(5,^)1 < 5(loglogn) 

I 0<s<T 0<t<Tn+s 



-Bs.n = S sup sup \Sn{s,t) - S{s,t)\ < I 
I 0<s<TO<t<T„+s 

Let Bn = Bi^n n -B2,„ n B3,„. T/ien lim„^oo -P(-Bn) = 1- 

Proof. Note that -Bi^n is the complement of the event on the right-hand side of (j40p with ^ log log n 
in place of K, and i?2,n is the complement of the event in (j4ip . Therefore, by Lemma [TH we have 
lim„^oo P{Bi^n) = linin^oo P{B2.n) = 1. We also have lim„_^oo PiB^^n) = 1 by Lemma HS) □ 

Recall that the event An was defined in Lemma [191 The next lemma shows that when An 
and Bn occur, it is unlikely that the processes X„ and Yn are ever far apart before time Tn- In 
view of Lemmas [19] and [2H this result implies ()56p , and therefore Theorem [71 

Lemma 25. We have 

p(AnnBnn{ sup \Xnit)-Yn{t)\> —^—-.\) <M^^^+4{lognfTn£n 

V lo<t<T„ (logn)Vif'jy nV%„ 

for sufficiently large n. 
Proof. We claim that on the event 

f log n 1 

Anr\Bnr\{^— sup |M„(t)| < 



n o<t<T„ log n 



n I ^ |(Z„(ri) - Zn{Ti-)) - {Ynin) - Yn{Ti-))\l^\x„in-)\<loglogn} < (^qg^^l/s }' ('^^^ 

we have < 3/(logn)Vi6 for all t < r„, and therefore -y„(t)| < 4/ (log n) Vie 

for all t < T„ by (I59p . Thus, by Lemmas [T8l and [20l this claim implies the result. 
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To prove the claim, let C = mf{t < r„ : - Yn{t)\ > 3/(log n)Vi6}. We assume that we 

are working on the event in (j74p . and we must show that C is the infimum of the empty set, and 
thus that C = oo. On Bi^n H B^ n, we have 



1 e'H^ 1 

Yn{t) < -loglogn+^- + |y„(t) -y(t)| < -loglogn + 2 



(75) 



for all t <Tn. Therefore, if t < ^ and n is sufficiently large, then 

log n 



\Xn{t)\ < \Yn{t)\ + \Zn{t) - Yn{t)\ + ^|M„(t)| 

n 

1, , 3 1 

< -loglogn + 2+ — — + 

2 (log n) 1/1° logn 

< log log n. 



(76) 



Note also that Sn{t) < j log n for all t < Tn on i?2,n H B^^n for sufficiently large n, and Tj+i —Ti< 
En log n for z = 0, 1, . . . , J„ on A^^n- Therefore, Lemmas [21] and [23] imply that if Tj + /i < and 
h < Tj+i — Tj, then 



(z„(Ti + /i) - z„(Ti)) - (Ynin + h)- Ynin)) + 



Ti+h 



{Xn{t)-Yn{t))dt 



3/lloglogn ^1/2 , ,2i 

- Vu^ + ^ (log n) / + /i log 

(lognji/^ 



n. 



Since J^;''^^ |X„(t) - Z„(t)| < /i/(logn), it follows that 



(Z„(Ti + h)- Zn{Ti)) - {Yn{Ti + h) - Yn{Ti)) + 



Ti+h 



{Zn{t)-Yn{t))dt 



^ 4/ilog logn ^^^^ 
~ (logn) 



for sufficiently large n. 
Define the sets 





iG{0,l 




iG{0,l 




iG{0,l 



i G {0, 1, . . . , J„ + 1} : |Z„(ri) - Yn{Ti)\ < 
i G {0, 1, . . . , J„ + 1} : Z„(ri) - y„(r,) > 
i G {0, 1, . . . , J„ + 1} : Z„(ri) - Ynin) < 



1 



(logn)Vi6 j' 
1 



(logn)Vi6 J' 
1 

~(logn)i/i6 j- 



Clearly, we have /i U /2 U /a = {0, 1, . . . , + 1}. By right continuity and the fact that Zn{0) = 
^n(O) = 0, we have ( > = tq. Suppose that ( £ (rj,Ti+i) for some i £ Ii. Then, by (f77|l and 
the fact that Tj+i — Tj < e„ log n. 



l^n(C) - yn{0\ < \Zn{n) " Yn{n)\ + / |Z„(0 - Yn{t)\ dt + 

J Ti 

1 



4(C-Tj)log logn 
(log n)i/2 



< 



(logn) 1/16 



+ En log n 



(logn)i/i6 



+ 4e„(log n)^/^ log log n. 



(78) 
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a contradiction for sufficiently large n because \Zn{0-Yn{C)\ > 3/(logn)i/i6 by right continuity. 
Thus, if i G Ii, then ( > Tj+i. Therefore, if i £ Ii, then reasoning as in (j78|) and using the fact 
that we are working on the event (I74p . we get 

|Z„(r,+i) - y„(r,+i)| < |Z„(r,) - y„(T,)| + T""' - Y^{t)\ dt + -^""^^^ " 



(log n)i/^ 

+ |(Z„(ri+i) - Z„(rj+i-)) - {Ynin+i) - y„(ri+i-))| 

1 3 
- 7i TTTTT^' + ■ 7^ TmT' + '^^^ (1°S ")^^^ + 



(log n) 1/16 (log n) 1/16 ' ^ ^ to b ^ (log n) 1/8 

(log n)i/i6 

for sufficiently large n, which implies that ^ Tj+i- 

Next, suppose that i G l2- Let p = inf{t > Tj : Z„(t) < y„(t)}. Suppose p < min{(^, Tj+i}. 
Then < Z„(s) - Yn{s) < 3/(logn)i/i6 for ah s < p. Therefore by dZZ]), 

Znip) - Yn{p) > Znin) - y„(r,) - [\Zn{t) - Yn{t)) dt - ~ ^ 



(logn)i/2 (logn)i/8 

13 1 

- n uTTe 7^ tytxf - 4e„(logn)^/2 loglogn - — , (80) 

(logn)i/i6 (logn)i/i6 (logn)i/*^ 

which is positive for sufficiently large n, a contradiction. Therefore, for sufficiently large n, if 

i £ I2, then Z„ must stay greater than Yn from time Tj until after time min{C, Tj+i}. In particular, 

ii i £ I2 and C > Tj+i, then i + l £ /i U /2. By the same argument with the roles of Yn and Z„ 
reversed, ii i £ I3 and > Tj+i, then i + 1 £ IiU I3. It follows from these observations and (179]) 
that the only way we could have C G (''"ij ''"i+i] with i G /2 is if there exists j < i such that j £ Ii, 
j + IJ + 2, . . . ,i £ I2, and 

ZniO - YniO > Zn{T,+i) - Y^{t,+i) + \ 

(logn)^/^° 

However, if this is true, then Z„(t) > y„(t) for all t £ [tj^i, (]. Therefore, using (j77|) and the fact 
that we are working on the event in ()74p . we have 

i+l 

ZniO - YniO < Znirj+i) - Y^irj+i) + \{Zn{rk) - ZniTk-)) - (y„(rfc) - YniTk-))\ 

k=j+2 

4(C - Tj-i)(log log"-) 



+ 



(log n) 1/2 



< Z„(r,- + ij - YniTj + l) + "YTF H 7j SI/2 • 

(logn)i/* (logn)i/^ 

For sufficiently large n, the sum of the last two terms on the right-hand side is less than 
l/(log n)i/i^, a contradiction. Hence, we can not have ( £ {Ti,Ti^i] for i £ I2, and the same 
argument with the roles of Yn and Zn reversed establishes that we can not have C £ (rj,rj-|_i] 
with i £ I3. We conclude that C = 00, which completes the proof. □ 
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3.5 Extension to arbitrary starting times 



The result (|56|) pertains to the evolution of the number of lineages when we trace back the 
ancestral lines of the individuals in the population at time zero. Of course, by stationarity, 
the analogous result holds if we instead trace back the ancestral lines of the individuals in the 
population at some other time s > 0. However, to help with the proof of Theorem [5] in the next 
section, we will need a stronger version of this result that will make it possible to show that the 
approximation works well simultaneously at many times. The key to this result is that the events 
An and Bn were defined so that the bounds that hold on these events are valid simultaneously 
for all s e [o,r]. 

Recall that Nn{s,t) denotes the number of individuals in the population immediately before 
time s — t with a descendant alive in the population at time s. Let Nn{s, t) = Nn{s, t)l{Ar(s,j)>i}, 
which will be convenient because 

/•oo 

Ln{s)= / Nnis,t)dt. (81) 



V ( +^ ^°S^/^)\7 ( ^ * ^ _t nte Moglogn 
Xn[s, t) = Nn , -j - ne 



Let 

n \ \ log n log n J log n 

Also, let 

Yn{s,t) = e-'Sn{s,t)+ 2 . 

Note that X„(0, t) = for all t < (log n) inf{s : iV„(s) = 1} and y„(0, t) = Yn{t) for all t > 0. 

We have the following extension of Lemma [25j The result follows from the same argument 
that gives Lemma [251 We have replaced 4/(logn)^/^^ by 5/(logn)^/^^ to account for the error 
in replacing Nn{s,t) by 7V„(s,t). Indeed, it would be possible to use A/{logn)^/^^ + (logn)/n 
because \Nn{s,t) — Nn{s,t)\ < 1 for all s and t. Also, because an interval of time T„ + s rather 
than Tfi must be considered when adapting the proofs of Lemmas 1181 and [20l the bounds involving 
Tn have been replaced by bounds involving T„ + T. 

Lemma 26. For sufficiently large n, we have 

P(A„nB„n| sup \Xn{s,t)-Yn{s,t)\> 



0<t<T,,+s 



(logn)i/i6 



^ (r„+r)(l0gn)3/2 ^2^rr ^rj.. 

< Y/2 h4(logn) [Tn + T)er, 

n I En 



for each fixed s € [0, T]. 



4 Proof of Theorem [5] 

Recall that for each s S M, a process {S{s,t),t > 0) was defined in section [3TT1 Let J(s) = y if 
(— s, y) is a point of ^, and let J(s) = otherwise. For each s G M and t > 0, let 

Yis,t) = e~'S{s,t) + ^, 
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and then let 



L{s) = -J{s) + / Y{s,t) dt. 
Jo 



Note that P{J{s) = 0) = 1 for each s > 0, but the —J{s) term keeps the process (L(s),s > 0) 
right continuous at the jump times and thus ensures that {L(s), s > 0) has paths that are almost 
surely right continuous with left limits. To see this, note that if (— s,y) is a point of ^, then 
S{s+,t) = S{s,t) - y and thus Y{s+,t) = Y{s,t) - e'^y for ah t > 0. Therefore, 



poo poo 

/ Y{s+,t)dt= / Y{s,t)dt-y. 
Jo Jo 



The next result shows that {L{s), s >0) has the same law as the process defined in the statement 
of Theorem [SJ 

Proposition 27. Let v he the measure on M. whose density with respect to Lebesgue measure 
is given by x^^l(_oo,o)(^)- "^he process (L(s),s > 0) is a process of Ornstein-Uhlenbeck type 
generated by (2 — 7, 0, z/, 1). 

Proof. Fix t > 0. Note that if < s < t and n > 0, then S{s,u) = S{t,t - s + u) - S{t,t- s). 
We use the notation f{x) dSt{x) to denote the stochastic integral of f{x) with respect to the 
stable process {S{t,x),x > 0). If < s < t, then 

/•oo 

L{s) = 1 - J(s) + / e-"5(s, u) du 
Jo 

fOO / ft — S + U 



e-"fy^ dSt{x)\du 
1-J(s)+ / / e-"" du]dSt{x) 

Jt~s \ Jx-t+s J 



roo 

= 1-J(s) + / e-(^-*+^) (82) 

Jt-S 

where in the next-to-last step we used Fubini's Theorem for general stochastic integrals (see, for 
example. Theorem 45 in Part IV of [27] and the remark on p. 161 in [27] that the measure in 
that theorem can be taken to be a- finite rather than finite). Note that J(0) = almost surely, 
and almost surely Jq J(s) ds = for all t. Therefore, using Fubini's Theorem again and (|82p . we 
get 

pt poo poo 

L{t) - L(0) + / L{s) ds = t- J{t) + / e"^' dSt{x) - / e-(^~*) dSt{x) 
Jo Jo Jt 

pt poo 

+ / / e-^'^-'+'UStix) ds 

Jo Jt~s 



poo poo 

t-J{t)+ e-'^dStix)- e-^""-'^ dSt{x) 
Jo Jt 



00 ft 



Jo Jmax{0,t-x} 



t - J{t) + ^* (^e-^ + ^* e-(^-*+^) d^ dSt{x) 

+ (^e^^ - e-(^"*) + ^* e-(^-*+^) d^ dSt{x) (83) 
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for all t > almost surely. Because 

ft 



It-x 



and 



it follows that 



e 



Jo 



L{t) - L(0) + f L{s) ds = t- J{t) + [ dSt{x) = t - J{t) + S{t, t) 
Jo Jo 

for all t > almost surely. Therefore, if we define Z{t) = S{t, t) + 1 — J{t) for all t > 0, then 

L{t) = L(0) + Z{t) - [ L{s) ds, (84) 
Jo 

which is exactly ([8]) with c = 1 and L in place of X. 

By the symmetry of the construction in subsection 13. H the processes {S{0,t),t > 0) and 
{S{t,t) — J{t),t > 0) have the same law. Indeed, if one reflects the points of the Poisson process 
about the vertical axis, so that a point at (t, x) is moved to {—t, x), and then follows the procedure 
used to construct (S'(0,t),t > 0), one obtains the process {S{t,t) — J{t),t > 0) provided that 
J(0) = 0. Therefore, the process {S{t,t) — J{t),t > 0) is a stable process whose characteristic 
exponent is given by the expression in (p6|) . Since {Z(t),t > 0) differs from {S{t,t) — J{t),t > 0) 
only by the addition of a linear drift of rate 1, it follows that {Z{t),t > 0) is a stable process 
whose characteristic exponent is obtained by replacing 1 — 7 with 2 — 7 on the right-hand side of 
(|36p . This observation, combined with (I84j) . implies the result. □ 



Remark 28. The process (L(s),s > 0) is clearly stationary by construction. Thus, it follows 
from Propositon [27] and the theory of processes of Ornstein-Uhlenbeck type reviewed in the 
introduction that the distribution of L{0), and therefore the distribution of L{s) for any fixed 
s > 0, has a characteristic function given by the right-hand side of ([7]). To observe this result 
more directly, use the Integration by Parts Formula (see, for example. Corollary 8.7 of [2lJ) to 
write that almost surely 



roo 

L(0) = / Y{t)dt = l+ I e-^S{t)dt = l+ / e-^dS{t). (85) 
Jo Jo Jo 

The distribution of the stable integral on the right-hand side of ()85p can be evaluated using the 
theory developed in Chapter 3 of pQJ. In this case, we apply Proposition 3.4.1 of [29J with m 
being 7r/2 times Lebesgue measure, E = [0, 00), a = 7r/2, ^ = 0, /3 = —1, and f{t) = e~*. We 
get Uf = 7r/2, /3j = —1, and fij = — te~* dt = —1 to recover the result. 



By (ISTI) . we have 

(logn)^ / f s \ n n log log n 



l2 



(logn)^ 1 ~ / s ^ \ * "-^s * log log n 



n Jq log n \ log n ' log n / log n (log n 

Consequently, the following proposition will imply Theorem [5l 



/■OO 

dt= Xn{s,t)dt. 
Jo 
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Proposition 29. We have 



sup 

0<s<T 



Xn{s,t) dt - L(s) 



>p 0. 



To prove Proposition [291 we need to compare the integrals Xn{s,t) dt and Y{s,t) dt. 
Lemmas [30] and [32] below will show that it suffices to consider the integrals of X„ and Y up to 
time Tn + s, and Lemma [33] will allow us to replace Y hj Yn- 



Lemma 30. We have 



Proof. We have 



("OO 

sup / Y{s, t) dt — >p 0. 
o<s<r Jt„+s 

/•oo 

sup / e~^S{s,t) dt 
<s<T Jt„+s 



by ([13]) . Also, it is clear that 



oo g-t^2 

lim / — ^ dt = 0. 

n— >oo 



The result is immediate. □ 
Lemma 31. For all e > 0, there exists a positive constant C such that 



liminfPl Nn{ 7^ ) < TT^^ 1 > 1 - e. 
log n ) (log n) 



Proof. Let {Iln{t),t > 0) be a Bolthausen-Sznitman coalescent started with n blocks. There is a 
well-known method for constructing a random partition having the same distribution as n„(t). 
Let a = e~*. Then let Ji > J2 > . . . denote the points of a Poisson point process on (0, 00) 
whose intensity measure is given by x~^~'^ dx. Let S = X^Si "^i; ^^'^ divide the interval [0, 1] 
into disjoint subintervals of lengths Ji/ S, J2/ S, . . . . Let Ui, U2, ... be i.i.d. random variables 
having the uniform distribution on [0, 1]. Then n„(t) has the same distribution as the partition 
of { 1 , . . . , n} such that i and j are in the same block if and only if Ui and Uj land in the same 
subinterval of [0, 1]; see, for example, section 4 of [5j or section 2.4 of [25j. Therefore, Nn{t) has 
the same distribution as the number of blocks of n„(t). Thus, by (3.13) of [26] . 



ar(Q)r(n) 

Therefore, by Stirling's Formula, there exist positive constants Ci and C2 such that for all n > 2 
and t > 0, 

Now, take t = T„/(logn), so that a = e-^"/(i°s"). Because 

a<l--^+ " 



logn 2(logn)2' 
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we have 

< ne-^n+7^nV2iogn < ^^^^ 
(log ny 

for some positive constant C3. Combining ([SH]) and ([57|) . we get that 



^ C4n 



(log n) 



V log n ^ 

for some positive constant C4. The result now follows from Markov's Inequality. □ 
Lemma 32. We have 



o< 

Proof. It is easy to check that 



/•oo 

sup / X„(s, t) — )-p 0. 

<s<T Jt„+s 



\ogn f°° ( _^ ^ nte * log log n 



lim / ne"'^ H — ) = 0. 

Therefore, it suffices to show that 

logn ,y f s t , , 

sup / Nn , dt 0. 

o<s<T n Jt„^s \logn logn/ 

Note that 



logn logny \ logn^ 

because the number of individuals in the population immediately before time {s — t)/ log n who 
have descendants in the population at time will be at least as large as the number of individuals 
in the population immediately before time (s — t)/ log n who have descendants in the population 
at the later time s/(logn). It follows that 

logn ( s t \ , logn f°° ~ ( t-s\ , 

sup ^ / mJ- ,- cit<sup^/ Nnio,- )dt 

o<s<T n Jt„+s \logn logn J o<s<T n Jt„+s V logny 



n Jt„ V logn 
(logn^2 rco 



Nn{0,t)dt. (88) 

JTn/logn 

Conditional on A^„(0, T„/ log n) = m > 2, the distribution of fx' /iogn^n{0,t) dt is the same as 
the distribution of Lm{0). Let e > 0. By Lemma EH there is a positive constant C such that 
P(7V„(0,T„/logn) < Cn/(logn)^) > 1 — e for sufficiently large n. Conditional on the event that 
A''„(0, Tn/ log n) < Cn/ (log n)^, the distribution of j^^^ Nn{0, t) dt is stochastically dominated 
by the distribution of Lm{0) for m = [Cn/(logn)^J . Thus, by ([U]), there is a positive constant K 
such that 

/ f°° Kn \ 

P[ Nn{0,t)dt<- -3 >l-2e 

\Jt„/ logn (logn)V 

for sufficiently large n. Hence, the right-hand side of (|88p converges in probability to zero, which 
gives the result. □ 
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Lemma 33. We have 



sup / \Yn{s,t) -Y{s,t)\dt ^pO. 

0<s<tJq 



Proof. Note that 



\Yn{s,t) -Y{s,t)\dt = / e~^\Sn{s,t) - S{s,t)\ dt 







< sup \Snis,t) - S{s,t)\. (89) 

0<t<Tn+S 



Therefore, 



sup / \Yn{s,t) — Y{s,t)\ dt < sup sup \Sn{s,t) — S{s,t)\ 

)<s<tJo 0<s<TO<t<Tn+s 



rTn+s 

0<s<TJo 0<s<T 0<t<T„+s 

by Lemma [T5j □ 



Choose fixed times = sq < si < ■ ■ ■ < Sm = T such that l/(logn)^ < Sj+i — Sj < 2/(logn)^ 
for i = 0,1, . . . ,m — 1. This is clearly possible for sufficiently large n, and m < T(logn)^. Let 
e > 0, and let {—ui,yi),...,{—Uk,yk) denote the points of ^' in the region [— T, 0] x [e^,oo). 
Note that with probability one, there are only finitely many points of ^ in this region. For 
i = 0, 1, . . . , m — 1, let Gi be the event that none of the uj falls in [si, Sj+i], and let Hi be the 
event that exactly one of the uj falls in [si, Sj+i] and that this point uj is in {si, Sj+i). Note that 
almost surely none of the uj land on one of the points sq, . . . , Sm, and with probability tending 
to one as n — )• oo, no two of the points Uj fall in the same interval [sj,Si+i]. Consequently, we 
have 

lim pi n{GiUHi)) =1. (90) 

^ 1=0 ^ 

On Hi, let be the value of j such that Uj £ (sj, Sj+i). 
Lemma 34. We have 

rTn+s rTn+Si+i 



lim sup P I sup sup 







Yn{s,t)dt- / YniSi+i,t)dt 



> 4e < ATe. 



Proof. If < s < Sj+i and t > 0, then 

Sn{s, t) = Sn{Si^l,t + Sj+i — s) — 5'„(Sj+i, Sj+i — s). 
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Therefore, 

rTn+s 



Ynis,t) dt 







Tn+S 



e ^Snisi+i,t + Si+i - s) dt 



e 5„(si+i, Sj+i - s) + 



dt 



10 JQ 







T„+s 



e-*5„(si+i,t) dt-e'^+^- 
e~*5'„(si+i, Si+i - s) + 







e Sn{si+i,t)dt 
dt 



y„(s,+i,t)dt + (e^'+i-^-l) 







+ 1 



e ^Sn{si+i,t) dt 



e Sn{si+i,t)dt 



Tn+S 



e ^^^(sj+ijSi+i - s) dt 
(91) 



We must bound the last four terms on the right-hand side of (|9ip . Now e*'+^ — 1 < 3/(logn)^ 
for sufficiently large n, so 



Si+i- 



1) 



e *Sn{si+i,t) dt 



< 



< 



(logn)2 
3 

(log n)2 
+ 



1 + 



1 + 



Tn+S 



y„(si+i,t) dt 



T'n,-\-Sj. 



\Yn{Si+i,t)-Y{si+ut)\ dt 



Y{si+i,t)dt 



i+1 



+ 



Y{s^+i,t)dt 



3 / fTn+S 

<- ^ 1+ sup / \Yn{s,t)-Y{s,t)\dt 

(log nj^ \ 



+ sup 

0<s<T 



o<s<r7o 

oo 

Y{s,t) dt 

T„+s 



+ sup \L{s)\ + sup \J{s)\ ]. (92) 

0<s<T 0<s<T 



Also, 6^^*+^ < 2 for sufficiently large n, which means 



e *S'„(si+i,t) dt 



<2(si+i-Sj) sup \Sn{Si+l,t)\ 

0<t<Si+i~Si 

4 



< 



(logn)2 o<t<s,+i-s. 



sup |S'„(si+i,t)|. 

l+l— 



Likewise, 



rT„+s 

/ e~*5n(si+i,Si+i - s) dt 
^0 



< sup \Sn{Si+l,t)\. 

0<t<Si+i-Si 



(93) 



(94) 
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Finally, 



Tn+Si + l 



dt 



< 



(log n) 



(95) 



for sufficiently large n. Combining (|92p . (j93p . (|94p . and ()95p with (|9ip. we get that for sufficiently 
large n, 



sup sup 

0<j<m— 1 s£[si,Si^i] 



Tn+S l-Tn+Si+i 

Yn{s,t)dt- / Yn{Si+i,t)dt 
JO 



4 3 

< 71 To +2 sup sup \Sn{Si+l,t)\ + 

(lognj^ 



0<j<m— 1 0<t<Si-|-i — Si 



(log n)2 



sup / \Ynis,t)-Y{s,t)\dt 
o<s<tJo 



+ sup 

0<s<T 



Y{s,t) dt 



T„+s 



+ sup \L{s)\ + sup I J(s) 

0<s<T 0<s<T 



(96) 



Note that supo<5<^ | J(s)| is almost surely finite by properties of the Poisson process and 
supo<s<T 1-^(^)1 is almost surely finite by Proposition [271 Combining these observations with 
Lemmas [30] and [33l we see that the probability that the third term on the right-hand side of ()96p 
is less than e tends to zero as n — t- oo. Clearly 4/(logn)^ < e for sufficiently large n. To control 
the second term, apply Lemma [T6l with 5„ = Sj+i — Si and = to get 



PI sup \Sn{si+i,t)\lc, > e\ < pI sup \Sn{si+i,t)\ > e Gi\ <4{si+i- Si)e, 

\0<i<Si+i-Si / \0<t<Si+i-Si / 

and thus 

P 2 sup sup |5„(si+i,t)|lG, >2e <4e^(s,+i-sO = 4re. 

V 0<i<m—l 0<t<Si^i—Si J 

Combining these bounds with (196p gives the result. 
Lemma 35. We have 

rTn+s 



□ 



lim PI sup sup 

n^oo V,0<j<m-1 se[s„s,+ i] 



/ y„(s,t) - J(uj(j))l|,<„^,^^j} 





y„(si+i,t) dt 



> 5e 1 = 



Proof. Note that (|9ip still holds in this setting. Consequently, if Sj < s < Sj+i, then on the event 
Hi, we have 

T„+S f'Tn+Si+i 

Yn{s,t) dt - J{Uji^i))l{s<u,u-,} - / Yn{si+l,t) dt 

' ' Jo 

(e-»+i--_l) / e-'Sn{si+i,t)dt + e'^+'-' / e-*5„(s,+i, t) 

Jo Jo 

+ / e~*5„(Si+i,Si+i - S) dt + J(Uj(i))l{s<„^,^^j} + / 



< 



dt 



(97) 
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The bounds ([92]) . ([93]) . and ([95]) also hold. In place of observe that if Sj < s < Sj+i, then 
on the event Hj, 



Tn+S 



e *5„(si+i,Si+i - s) + J(nj(i))l{^<„^.^^j} 



Sn. (S' 



i+li 



1 



T„+s 



+ '^K-{i))l{s<«,(o} 



< |S'„(sj+i,Si+i - s) + -''(^ij{j))l{s<«j.(,)}| + |5'„(sj+i,Si+i - s)e' 



-(T„+s)| 



< sup 



|5n(Si+l,t) + J(Uj(i))l{t>s,+^_„ }| + ^2 sup \Sn{Si+i,t)\. (98) 

1 - »+i jw/i (logn)^ o<t<s,+i-s, 



Also, on the event Hi, 

sup |5„(sj+i,t)| < |J(Mj(i))| + sup \Sn{Si+l,t) - J{Uj(i:^)l{t>Si+^-Uj^^)}\■ (99) 
0<i<Si+i-Si 0<t<Si+i-Si 



Thus, on the event Hi, by (HzD, ((98]), ([Ml), (|92]), dMj), and (195]), we have 



sup sup 

0<i<m— 1 sG[si,Si+i] 

4 

+ 



T„+s 



< 



(logn)2 (logn)2o<s<T 
5 



y„(s,t) - '/(?ij(i))i{s<«,(,)} 

sup |J(s)| 



'i^n('Sj+l,0 fit 



+ I 1 + 

3 



(log n)^ 



sup [^^(Si+l,*) + '/(^^i(i))l{i>Si+i-«^.(i)}| 
0<i<m— 1 0<t<Si+i— Si 



+ 



(logn)^ 



0<s<TJo 



Tn+S 



\Yn{s,t)-Y{s,t)\dt 



+ sup 

o<s<r 



T„+s 



Y{s,t) dt 



+ sup \L{s)\ + sup I J(s) 

0<s<T 0<s<T 



(100) 



By Lemmas [30l and [33l and the fact that supQ<g<j. |L(s)| and supo<g<y | J(s)| are finite almost 
surely, the probability that the first, second, or fourth term on the right-hand side of (jlOOp is 
greater than e tends to zero as n — t- oo. To bound the third term, we apply Lemma [TBI Observe 
that the conditional distribution of 

sup \Sn{Si+l,t) + J{Uj{i))l{t>si+i-Uj^i)]\ 
0<t<Si+i-Si 

given Hi is the same as the conditional distribution of 

sup \Sn{s,t)\ 
0<t<S„ 

given A{9) in Lemma [16] if we take (5„ = Sj+i — Si and 9 = e^. Because 

X-^dx< 3^ (101) 

3 e'^(logn)"' 



P{Hi) < (s,+i - s,) / 

Je- 
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it follows from Lemma [T6l that 

-P(2 sup sup \Sn{si+i,t) + J{uj^i))l{t>s,+i-u,u)}\^H, > '^e] 

V 0<i<m-10<t<s^+i-Si ■'^ ' / 



m-1 ^rp 



■^0 e^(logn)2' 

which tends to zero as n — )• oo. The result follows. □ 
Lemma 36. // s > and h > with < s + h < T , then 

rTn+s+h i-Tn+s 



Jo 



10 

and for sufficiently large n, 

rTn+s+h 



Xn{s + h,t) dt< / Xnis,t) dt + hlogn (102) 
Jo 



j Xn{s + h,t)dt 

> r' ('°«")'^- + ') („- A.Ji±i!. J^) + l) . (103, 

Jo log n n \ \ log n log n J J 

Proof. We have 

logn f'^"^"'^'^ fs + h t \ _j nte~*loglogn 



n 







Xn{s + h,t) dt = ^— Nn{- ,- ]-ne-' ° 



" \ log n ' log n J log n 



logn /■'^"+'^ tCt f s t \ _, nte * log log n , 

Nn , ] — Ue dt 



n Jq \ log n log n / log n 

logn /•'^"+«+^ / _ nte~* log log n \ , 
/ ne ' H I dt 

logn fTn+s+h fs + h t \ 

+ ^ / nJ ,- )dt 

n Jq \logn log ny 



n 







log n ' log n 







^"+^ logn /■^"+^+^ / , nte-*loglogn^ 

A„(s, t) dt / ne H dt 

n Jt„+s V logn ' 



logn ~ / s + h t . 

+ / Nn ,- dt 



n Jq " \ log n ' log n 

logn f'^"^\~r fs + h t + h\ f s t , , 

log n log n / V log n log n , 



n 



(104) 



Because Nn{s,t) < n for all s,t > 0, the integral in the third term on the right-hand side of 
()104p is bounded by hn, so the term is bounded by hlogn. Also, for all s,t,h > 0, we have 
N{s + h,t + h) < N{s, t) because the number of individuals in the population immediately before 
time {s + h) — {t + h) = s — t with descendants alive in the population at time s + h is less 
than or equal to the number of individuals in the population immediately before time s — t with 
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descendants alive in the population at time s. Therefore, the fourth term in the right-hand side 
of (jl04p is nonpositive. Equation (|102p follows. 

Next, we claim that for all s,t,h > 0, we have Nn{s, t) — Nn{s + h,t + h) < n — Nn{s + h,h). 
To see this, note that iV„(s, t) — Nn{s + h,t + h) is the number of individuals in the population 
immediately before time s — t with descendants alive in the population at time s but not at time 
s + h. This is at most the number of individuals in the population at time s that do not have 
descendants alive in the population at time s + h, which is at most n — Nn{s + h,h). Therefore, 
Nn{s, t) — Nn{s + h,t + h) < n — Nn{s + h,h) + 1. The result ()103p follows from this observation 
and ()104p because 

logn 1"'^"+*+'^ / nte~*loglogn\ f te~*loglogn\ 2h 

/ ne -I ] at < [h log n) sup e H < 

n Jt„+s V logn y t>T„ V logn y logn 



for sufficiently large n. 

Proof of Proposition By Lemmas [301 1321 and [331 it suffices to show that 



sup 

0<s<T 



l-Tn+S rTn + s 

/ Xn{s,t)dt + J{s)- / Yn{s,t 

Jo Jo 



dt 



0. 



□ 



(105) 



By Lemmas [191 [Ml and [26l we have 



lim sup P 

n— ^-oo 



Xn{si-,t) — Yn{si,t)\ dt > " "^-,^!^ for some i = 0, 1 



(logn)Vi6 



, J., . . . , 



m 



<limsup + 

+ {l + T(l„g„)-'<^»+^'('°'^">''' 

Therefore, for all e > 0, we have 



n^/^e 



+ 4(logn)2(T„ + r)e, =0 



lim P[ sup 

. 0<i<m 



XniSi,t) dt 



Yn(Si,t) dt 



> £ 



(106) 



We first consider the case in which Gi occurs. If s G [sj, Sj+i], then by ()102p . 



Xn{s,t) < 



Xn{si,t) dt + {s- Si)logn, 



(107) 



and therefore 



Xn{s,t) < 







+ 



Yn{s,t)dt + 

Yn{si,t) dt 

y„(si+i,t) dt 



Xn{si,t) dt 



YniSi,t) dt 







Yn{si+i,t) dt 



Yn{s,t)dt 







+ {s-Si)logn. (108) 
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Likewise, 

and so 

rT„+s 



rT„+s 

Jo 



Xnis,t) > 



Xnisi+i,t) dt - (si+i - s)logn, 



(109) 



X„(s,t) > 







Yn{s,t) dt 



Xn{si+l,t) dt 



y„(sj+i,t) dt 



Tn+S 



Ynis,t) dt 



Yn{si+l,t) dt 

(sj+i - s)logn. (110) 



Since Sj+i — Sj < 2/ (log n)^ for i = 0, 1, . . . , m — 1, it follows that 



sup sup 

0<i<m-l se[si,Si+i] 



T+s 



< h sup 

logn 0<i<m 



Xnis,t)dt 

Tji -\-Si 





T+s 



Yn{s,t) dt 



Xn{Si,t) dt 

Tn+S 



+ 2 sup sup 

0<i<m— 1 se[si,Si+i] 



Ynis,t)dt 



Yn{Si,t) dt 



Yn{Si+i,t) dt 



Clearly 2/ logn < e for sufficiently large n. Therefore, by ()106p and Lemma [Ml 



lim sup P I sup sup 

n.-s>oo \0<j<m-l se[si,Si+i] 



T+s 



T+s 



Xn{s,t)dt- / Yn{s,t)dt 



1g, > lOe < 4Te. (Ill) 



We next consider the case in which Hi occurs. Recall that on Hi, we have G (sj,Sj+i) 



Let 



Ri = lim Nn 



S S — Si 



That is, Ri is the number of individuals in the population immediately before time Sj/logn 
with descendants in the population at least until immediately before time Uj(j)/logn. Note that 
A'"„(s/ log n, (s — Si)/logn) > Ri for all s £ [si,Uj(j)). Therefore, by ()103p . for s G [s.j,Mj(j)) we 
have 



T„+s 



Xn{s,t)dt> / Xr,{Si,t)dt 



2{s-Si) {\ogn){Tn + T){n-Ri + l) 
logn n 



Combining this result with (jl07j) and the fact that Sj+i — Sj < 2/ logn gives 



Xn{s,t)dt 



Xn{Si,t) dt 



^2 4 (logn)(T„ + r)(n-i?, + l) 



log n (log n) 



n 
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It follows that if s G [si,nj(j)), then 



Tn+S 



Xn{s,t)dt 



Tn+S 



2 4 

< : + 



Yn{s,t)dt 

(logn)(r„, + r)(n-i?i + l) 



logn ' (logn)'^ 



XniSi,t) dt 



n 







+ 



Yn{Si,t) dt - J{Uj(i^) 

y„(s,t) dt - J{uj(j_)) - 



Yn{Si,t) dt 



y„(sj+i,t) dt 







Yn{Si+i,t) dt 



(112) 



Likewise, let 



S^ = Nn 



log n ' log n 



and note that A^„(sj+i/ log n, (sj+i — s)/logn) > Si for all s G (tij(j), Si+i]. Therefore, if s G 
(nj(j), Sj+i], then (|103p gives 



Jo 



Xn{s,t) dt < 



Xn{Si+i,t) dt + 



2{si+i-s) , (logn)(T„ + r)(n-5i + l) 



logn 



+ 



n 



(113) 



Because Xn{uj(^i'j,t) = lims^^^t^^.j X„(s,t) for Lebesgue almost all t > 0, equation (IllSp holds for 
all s G Sj+i]. Combining this result with ()109p gives 



^n(s,t) dt 



^2 4 (iogn){Tn + T){n-Si + l) 

- ' /-I \ Q ' 



logn (logn)" 



n 



Therefore, if s G [nj(j), Sj+i], then 



Tn + S 



Xnis,t) dt + J{s) 



2 4 

< + T, 7^ + 



Ynis,t)dt 

{logn){Tn + T){n-Si + l) 



log n (log n)'^ 



n 



+ 



+ 



Xn{Si+i,t) dt 



Tn + S 



i+l 



YniSi+i,t) dt 



Tn+S 



Ynis,t) - Jis) 





Tn+Si^l 



Yn{Si+i,t) dt 



(114) 
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By (jll2p and (|114p . and the fact that J{s) = if s E [sj, tij(j)) U (uji^i), Sj+i], 



sup sup 

0<i<m-l se[si,Si+i] 



dt 



< 



+ 



rT+s rT„+s 
/ Xnis,t)dt + J{s) - / Yn{s,t) 

Jo Jo 

4 (logn)(r„ + r)(2n-i?i-5i + 2)lj^^ 

+ sup 



logn (logn)3 o<i<m-i 

f-Tn+Si 

+ sup / Xn{Si,t) dt 

0<i<m Jo 



2 sup sup 

0<i<m-l se[si,Si+i] 



Yn{Si,t) dt 



Yn{s,t) - J{Uj(i))l{s<u,,,.} 



y„(si+i,t) dt 



Iff.- 

(115) 



We claim that conditional distributions of n — Ri and n — Si given Hi are each stochasticahy 
dominated by the distribution of n — A'„(0, 2/(log n)^), which is the number of blocks lost by time 
2/(log 77.)^ in a Bolthausen-Sznitman coalescent started with n blocks. To see this, note that n — Si 
is the number of blocks lost by time (sj+i — nj(j))/logn < 2/(logn)3 in a Bolthausen-Sznitman 
coalescent. Likewise n — Ri is the number of blocks lost by time {uj(i) — Si)/(logn) < 2/(logn)^ 
in a Bolthausen-Sznitman coalescent started with n lineages, if we disallow the instantaneous 
merger caused by the birth event at time ^ij(j). Because Hi requires that exactly one of the 
falls in (sj, Sj+i), the effect of conditioning on Hi is the same as suppressing all mergers in which 
each lineage participates with probability greater than e^. This can only reduce the number of 
blocks lost. Thus, we have 

E[n-R,m<-^< 



(logn)3 (logn)^ 

by pO|) . By the same argument, E[n— Si\Hi] < 2n/(logn)^. Thus, using (jlOip and the fact that 
m < r(log n)^. 



E 



sup 

0<i<m-l 



(logn)(r„ + r)(2n -R,-Si + 2)1 h, 



n 



<"°«">'J" + ^''gp(g,) 

i=0 



An 



(log n) 



+ 2 < 



2(logn)(I'„ + r)m/ 4n 



e^n{logny \(logn)2 



+ 2 



as n — )• oo. Hence, by Markov's Inequality, 
2 



4 (logn)(r„ + r)(2n - R^ - Si + 2)1 „ 

~\~ sup vJ. 



n 



logn (logn)3 o<i<m-i 
Combining this result with pi5p , (|106p , and Lemma [351 we get 



lim sup P ( sup 



sup 



0<j<m— 1 s6[si,Si-|_i] 



rT+s pT+s \ 

J Xn{s,t)dt + J{s)- J Yn{s,t) dt 1h, > Ue j =0. (116) 



The result (|105p follows from (jllip . (|116p . and the fact that with probability tending to 1 as 
n — )• oo, we have Gj U ffj for z = 0, 1, . . . , m — 1 by (f90|) . □ 
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